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RESULT 1 
AAB24025 

ID AAB24025 standard; protein; 523 AA. 
XX 

AC AAB24025; 
XX 

DT 25-JAN-2001 (first entry) 
XX 

DE Human PRO1780 protein sequence SEQ ID NO: 13. 
XX 

KW Human; tumour; diagnosis; neoplastic disease; proliferation; cancer; 

KW identification; tumourigenesis ; anticancer; detection. 

XX 

OS Homo sapiens. 
XX 

PN WO200053750-A1. 



XX 

PD 14-SEP-2000. 
XX 

PF 02-DEC-1999; 99WO-US028551 . 
XX 

PR 08-MAR-1999; 99WO-US005028 . 

PR Ol-SEP-1999; 99WO-US020111 . 

PR 29-OCT-1999; 99US-0162506P . 

PR 30-NOV-1999; 99WO-US028313 . 

PR 01-DEC-1999; 99WO-US028634 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Botstein D, Goddard A, Gurney AL, Roy MA, Watanabe CK , Wood WI; 
XX 

DR WPI; 2000-594320/56. 

DR N-PSDB; AAC58107. 
XX 

PT Antibodies specific for PRO polypeptides, used to diagnose and inhibit 

PT the growth of tumors in mammals, and to identify inhibitors of PRO 

PT polypeptide activity or expression. 
XX 

PS .Claim 61; Fig 10; 226pp; English. 
XX 

CC The present invention describes an antibody that binds to a human protein 

CC (I) selected from: PR0381; PR01269; PRO1410; PR01755; PRO1780; PR03434; 

CC PR01927; PR03567; PR01295; PR01293; PRO1303; PR04344; PR04354; PR04397; 

CC PRO4407; PR01555; PRO1096; PRO2038; and PR02262. (I) has anticancer 

CC activity and can be used to diagnose tumours in mammals, by detecting 

CC complex formation when the antibody is contacted with test cells. 

CC Increased expression of genes encoding (I) can also be detected to 

CC diagnose tumours. Agents which inhibit the activity of (I), especially 

CC the antibodies, or an antisense oligonucleotide which hybridises to genes 

CC encoding (I), can be used to inhibit tumour growth, preferably by 

CC inducing cell death. Methods from the present invention can be used to 

CC identify compounds which inhibit the biological activity of (I) . AAC5 8019 

CC to AAC58102 represent PCR primers and hybridisation probes used in 

CC examples from the present invention for human PRO sequences. AAC58103 to 

CC AAC58122 and AAB24021 to AAB24040 represent human PRO polynucleotide and 

CC protein sequences given in the exemplification of the present invention 

XX 

SQ Sequence 523 AA; 

Query Match 100.0%; Score 2768; DB 3; Length 523; 
Best Local Similarity 100.0%; Pred. No. 6.3e-278; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

Qy 61 GPFMPDFKKEEKSYQVI SWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 

Db 61 GPFMPDFKKEEKSYQVI SWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 

Qy 121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 



121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 



Qy 181 LSWPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 240 

I I I I I I I I I I I I I II I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 240 

Qy 241 LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 300 

I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I! I I I 
Db 241 LLKAELWFINSDFAFDFARPLLPNTVWGGLMEKPIKPVPQDLENFI7VKFGDSGFVLVTL 300 

Qy 301 GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHLAANVKIVDWLPQSDLLM 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHLAANWIVT>WLPQSDLLAH 360 

Qy 361 PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 420 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 PS I RLFVTHGGQNS IMEAIQHGVPMVGI PLFGDQPENMVRVEAKKFGVS IQLKKLKAETL 420 

Qy 421 ALKMKQIMEDKRYKSAAVAASVI LRSHPLS PTQRLVGWI DHVLQTGGATHLKP YVFQQPW 480 

I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 ALKMKQIMEDKRYKSAAVAASVI LRSHPLS PTQRLVGWI DHVLQTGGATHLKP YVFQQPW 480 

Qy 48.1 HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 4 81 HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 
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XX 

AC AAY99419; 
XX 

DT 08-AUG-2000 (first entry) 
XX 

DE Human PRO1780 (UNQ842) amino acid sequence SEQ ID NO: 282. 
XX 



KW Human; PRO polypeptide; membrane bound protein; receptor; diagnosis; 

KW transmembrane; secretion; immunoadhesion; pharmaceutical; screening. 
XX 

OS Homo sapiens. 
XX 

PN WO200012708-A2. 
XX 

PD 09-MAR-2000. 
XX 

PF 01-SEP-1999; 99WO-US020111 . 
XX 

PR 01-SEP-1998; 98US-00987 16P . 

PR 01-SEP-1998; 98US-0098749P . 

PR 01-SEP-1998; 98US-00987 50P . 

PR 02-SEP-1998; 98US-00988 03P . 

PR 02-SEP-1998; 98US-009882 IP . 

PR 02-SEP-1998; 98US-0098843P . 
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PA (GETH ) GENENTECH INC. 
XX 

PI Baker K, Goddard A, Gurney AL, Smith V, Watanabe CK, Wood WI; 
XX 

DR WPI; 2000-237871/20. 

DR N-PSDB; AAA37101. 



PT New mammalian DNA sequences encoding transmembrane, receptor or secreted 

PT PRO polypeptides, useful for screening of potential peptide or small 

PT molecule inhibitors of the relevant receptor/ligand interactions. 
XX 

PS Claim 12; Fig 160; 773pp; English. 
XX 

CC AAA37022 to AAA37144 encode the new isolated human transmembrane, 

CC receptor or secreted PRO polypeptides given in AAY99340 to AAY99462. The 

CC transmembrane and receptor PRO proteins can be used for screening of 

CC potential peptide or small molecule inhibitors of the relevant 

CC receptor/ligand interactions. The polypeptides and nucleotide sequences 

CC encoding then have various industrial applications, including uses as 

CC pharmaceutical and diagnostic agents. AAA37145 to AAA37330 represent PCR 

CC primers and hybridisation probes used in the isolation of the PRO 

CC polypeptides from the present invention 

XX 

SQ Sequence 523 AA; 



Query Match 100.0%; Score 2768; DB 3; Length 523; 

Best Local Similarity 100.0%; Pred. No. 6.3e-278; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
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MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 


60 






1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 
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60 
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1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
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CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


121 


CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 


180 


Qy 


181 


LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


181 


LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 


240 


Qy 


241 


LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 


300 






1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


241 


LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 


300 


Qy 


301 


GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHLAANVKIVDWLPQSDLLAH 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDWLAANVKIVDWLPQSDLLAH 


360 


Qy 


361 


PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 


420 






1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 


420 



Qy 

Db 



421 ALKMKQIMEDKRYKS7VAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I 

421 ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 480 



Qy 481 HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 



RESULT 3 




AAB66168 




ID 


AAB66168 standard; protein; 523 AA. 


XX 






AC 


AAB66168; 




XX 






DT 


02-APR-2001 


(first entry) 


XX 






DE 


Protein of the invention #80. 


XX 






KW 


Secreted; transmembrane; gene therapy. 


XX 






OS 


Unidentified. 


XX 






PN 


WO200078961- 


-Al. 


XX 






PD 


28-DEC-2000 




XX 






PF 


18-FEB-2000; 2000WO-US004342 . 


XX 






PR 


23-JUN-1999, 


; 99US-0141037P. 


PR 


20-JUL-1999, 


; 99US-0144758P. 


PR 


26-JUL-1999, 


; 99US-0145698P. 


PR 


01-SEP-1999, 


; 99WO-US020111. 


PR 


29-OCT-1999, 


; 99US-0162506P. 


PR 


30-NOV-1999, 


; 99WO-US028313. 


PR 


02-DEC-1999, 


; 99WO-US028551. 


PR 


16-DEC-1999, 


; 99WO-US030095. 


PR 


05-JAN-2000, 


; 2000WO-US000219. 


PR 


06-JAN-2000, 


; 2000WO-US000376. 


XX 






PA 


(GETH ) GENENTECH INC. 


XX 






PI 


Baker KP, Botstein D, Desnoyers L, Eaton DL, Ferrara N, Fong S; 


PI 


Gao W, Goddard A, Godowski PJ, Grimaldi CJ, Gurney AL, Hillan KJ; 


PI 


Pan J, Paoni NF, Roy MA, Smith V, Stewart TA, Tumas D, Watanabe CK 


PI 


Williams PM, Wood WI ; 


XX 






DR 


WPI; 2001-071395/08. 


XX 






PT 


Secreted and transmembrane proteins and nucleic acids designated PRO, 


PT 


useful as hybridization probes, in chromosome and gene mapping and gene 


PT 


therapy. 




XX 






PS 


Claim 1; Fig 160; 787pp; English. 


XX 






CC 


The present 


invention relates to secreted and transmembrane proteins. 


CC 


These proteins and the DNA encoding them may be used as hybridization 



CC probes, in chromosome and gene mapping and in the generation of anti- 

CC sense RNA and DNA. They may also be used used to generate either 

CC transgenic animals or knockout animals which are in turn useful for 

CC development and screening of therapeutically useful reagents. The nucleic 

CC acids may also be used in gene therapy 

XX 

SQ Sequence 523 AA; 



Query Match 100.0%; Score 2768; DB 4; Length 523; 

Best Local Similarity 100.0%; Pred. No. 6.3e-278; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1 


MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 


60 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 M 1 1 1 1 

IllllllllllllllllllllfllllllJIIIIIIIIIIIIIIIIIIIIIJIVIti 




Db 


1 


MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 


60 


Qy 


61 


GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 


120 






1 1 1 1 1 1 1 II 1 1 1 1 1 I 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 




Db 


61 


GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 


120 


Qy 


121 


CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 


180 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


121 


CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 


180 


Qy 


181 


LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


181 


LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 


240 


Qy 


241 


LLKAELWFINSDFAFDFARPLLPNTVWGGmEKPIKPVPQDLENFIAKFGDSGFVXVTL 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


241 


LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 


300 


Qy 


301 


GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVH 


360 






I | I I I I I I I 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 




Db 


301 


GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDWl^AANVKIVDWLPQSDLLAH 


360 


Qy 


361 


PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 


420 






1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 


420 


Qy 


421 


ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 


480 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


421 


ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 


480 


Qy 


481 


HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 








1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


481 


HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 





RESULT 4 
AB033661 

ID AB033661 standard; protein; 523 AA. 
XX 

AC AB033661; 
XX 

DT 17-SEP-2003 (first entry) 
XX 



DE Novel human secreted and transmembrane protein PRO1780. 
XX 

KW Human; secreted and transmembrane protein; PRO; angiogenesis ; 

KW endothelial cell proliferation; wound healing; immune response; 

KW T-lymphocytes proliferation; neonatal heart hypertrophy; tumour; 

KW cardiac insufficiency disorder; calcium flux; inflammation; 

KW vascular endothelial growth factor-stimulated proliferation; 

KW mammalian kidney mesangial cell proliferation; Berger disease; 

KW nephropathy; Schanlein-Henoch purpura; celiac disease; Crohn's disease; 

KW dermatitis herpetiformis; diabetes; haemoglobin switch; insulinaemia ; 

KW pancreatic beta-cell precursor cell differentiation; thalassemias; 

KW obesity; auditory hair cell regeneration; hearing loss; bone disorder; 

KW cartilage disorder; sports injury; arthritis. 

XX 

OS Homo sapiens. 
XX 

PN US2003073130-A1. 
XX 

PD 17-APR-2003. 
XX 

PF ll-DEC-2001; 2001US-00015869 . 
XX 

PR 01-SEP-1998; 98US-0098716P. 

PR 01-SEP-1998; 98US-0098723P . 

PR 01-SEP-1998; 98US-0098749P . 

PR 01-SEP-1998; 98US-0098 750P . 

PR 02-SEP-1998; 98US-0098803P . 

PR 02-SEP-1998; 98US-0098821P . 

PR 02-SEP-1998; 98US-0098843P . 

PR 09-SEP-1998; 98US-0099536P . 

PR 09-SEP-1998; 98US-0099596P . 

PR 09-SEP-1998; 98US-0099598P . 

PR 09-SEP-1998; 98US-0099602P . 

PR 09-SEP-1998; 98US-0099642P . 

PR 10-SEP-1998; 98US-0099741P . 

PR 10-SEP-1998; 98US-0099754P . 

PR 10-SEP-1998; 98US-0099763P . 

PR 10-SEP-1998; 98US-0099792P . 

PR 10-SEP-1998; 98US-0099808P . 

PR 10-SEP-1998; 98US-00998 12P . 

PR 10-SEP-1998; 98US-00998 15P . 

PR 10-SEP-1998; 98US-0099816P . 

PR 15-SEP-1998; 98US-0100385P . 

PR 15-SEP-1998; 98US-010038 8P . 

PR 15-SEP-1998; 98US-0100390P . 

PR 16-SEP-1998; 98US-0100584P . 

PR 16-SEP-1998; 98US-0100627P . 

PR 16-SEP-1998; 98US-0100661P . 

PR 16-SEP-1998; 98US-0100662P . 

PR 16-SEP-1998; 98US-0100664P . 

PR 17-SEP-1998; 98US-0100683P . 

PR 17-SEP-1998; 98US-0100684P . 

PR 17-SEP-1998; 98US-0100710P . 

PR 17-SEP-1998; 98US-0100711P . 

PR 17-SEP-1998; 98US-0100919P . 

PR 17-SEP-1998; 98US-0100930P . 

PR 18-SEP-1998; 98US-0100848P . 



PR 


18- 


SEP- 
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PR 
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SEP- 
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PR 


30- 
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PR 
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OCT- 
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PR 
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-1998; 


PR 


06- 


-OCT- 
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PR 


07- 
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98US-0100849P. 

98US-0101014P. 

98US-0101068P. 

98US-0101071P. 

98US-0101279P. 

98US-0101471P. 

98US-0101472P. 

98US-0101474P. 

98US-0101475P. 

98US-0101476P. 

98US-0101477P. 

98US-0101479P. 
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98US-0102484P. 

98US-0102487P. 
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98US-0103711P. 
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98US-0105000P. 

98US-0105002P. 
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98US-0105694P. 

98US-0105807P. 

98US-0105881P. 
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98US-0106023P. 

98US-0106029P. 

98US-0106030P. 

98US-0106032P. 
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-2000; 


PR 
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98US-0106033P. 
98US-0106178P. 
98US-0106248P. 
98US-0106384P. 
98US-0108500P. 
98US-0106464P. 
98US-0106856P. 
98US-0106902P. 
98US-0106905P. 
98US-0106919P. 
98US-0106932P. 
98US-0106934P. 
98US-0107783P. 
98US-0108775P. 
98US-0108779P. 
98US-0108787P. 
98US-0108788P. 
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98US-0108867P. 
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98US-0108849P. 
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98US-0108851P. 
98US-0108852P. 
98US-0108858P. 
98US-0108904P. 
98US-0113296P. 
98US-0114223P. 
99WO-US000106. 
99US-0129674P. 
99US-0141037P. 
99US-0144758P. 
99US-0145698P. 
99WO-US020111. 
99WO-US021194. 
99US-0162506P. 
99WO-US028313. 
99WO-US028551. 
99WO-US030095. 
2000WO-US000219. 
2000WO-US000376. 
2000WO-US003565. 
2000WO-US004342. 
2000WO-US005004. 
2000WO-US005841. 
2000WO-US006884. 
2000WO-US013705. 
2000WO-US014042. 
2000WO-US014941. 
2000WO-US015264. 
2000WO-US023522. 
2000WO-US023328. 
2000WO-US030952. 
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-NOV- 


•2000; 


2 000WO- 


-USUoUo / J . 


01- 


-DEC- 
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2 000WO- 


-USUoz b / o . 
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■2001; 
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2 001WO- 


ttc c\ r\ a a a a 
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-JUN- 
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2 001WO- 


-US017800. 


20 


-JUN- 


-2001; 


2001WO- 


-US019692. 


29- 


-JUN- 


-2001; 


2001WO- 


-US021066. 


09 


-JUL- 


-2001; 


2001WO- 


-US021735. 


04 


-SEP- 


-2001; 


2001US- 


-00946374. 



PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Baker KP, Botstein D, Desnoyers L, Eaton DL, Ferrara N, Fong S; 
PI Gao W, Goddard A, Godowski PJ, Grimaldi JC, Gurney AL, Hillan KJ; 
PI Pan J, Paoni NF, Roy MA, Smith V, Stewart TA, Tumas D, Watanabe CK; 
PI Williams PM, Wood WI; 
XX 

DR 4 WPI; 2003-585293/55. 
DR N-PSDB; ACD68440. 
XX 

PT Novel isolated PRO polypeptides e.g. PRO1130, PR01275, PR01418, PR01555, 
PT PR01787 that modulate glucose or free fatty acid uptake by skeletal 
■PT muscle cells, and are useful .for .treating .diabetes, .hyper- or hypo- 
Query Match 100.0%; Score 2768; DB 6; Length 523; 
Best Local Similarity 100.0%; Pred. No. 6.3e-278; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

I M I I I I I I I I I I! I I I I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M M 

MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 

I | | | | | | | | | | I I I I I I I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I M 
GPFMPDFKKEEKSYQVI SWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 12 0 

CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 18 0 

I M I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 

LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 24 0 

I I I I I I I I I I I II I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 24 0 

LLKAELWFINSDFAFDFARPLLPNTVTVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 30 0 

I | | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 300 

GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSH^ 360 
I I I I II I I I II I I I I I I I I I I I I I I I M II I I I I II II I I I I I I I I I I I I I I I I I I I I I 
GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDV11LAANVKIVDWLPQSDLLAH 
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61 
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181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 



360 



PS I RLFVTHGGQNS IMEAIQHGVPMVGI PLFGDQPENMVRVEAKKFGVS IQLKKLKAETL 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I M I I I I I I I I I I I I I I 

PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 420 

421 ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 480 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 421 ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 480 



Qy 4 81 HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 
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Human; secreted protein; transmembrane protein; PRO; vulnerary; cardiant 
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stimulated T-lymphocyte proliferation; immune response suppression; 
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KW 
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XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Baker KP, Botstein D, Desnoyers L, Eaton DL, Ferrara N, Fong S; 
PI Gao W, Goddard A, Godowski PJ r Grimaldi JC, Gurney AL, Hillan KJ; 
PI Pan J, Paoni NF f Roy MA f Smith V, Stewart TA, Tumas D, Watanabe CK; 
PI Williams PM, Wood WI; 

XX 

DR WPI; 2003-492259/46. 
DR N-PSDB; ACH04542. 
XX 

PT Novel secreted and transmembrane polypeptides and polynucleotides 

PT encoding them useful for treating various cardiac insufficiency 

PT disorders, bone and/or cartilage disorders such as sports injuries and 

PT arthritis. 

Query Match 100.0%; Score 2768; DB 7; Length 523; 

Best Local Similarity 100.0%; Pred. No. 6.3e-278; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I 
Db 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

Qy 61 GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 

I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVTiEYLALQ 120 

Qy 121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 



Qy 



181 LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 240 



Db 181 LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 240 

Qy 241 LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 300 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 300 

Qy 301 GSMWTCQNPEIFKEIWNAFAHLPQGVIWKC^ 360 

I I I I I I I I I I I I I I I i I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 GSMVNTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHIAANVK1VDWLPQSDLLAH 360 

Qy 361 P S I RL FVTH GGQN S IMEAT QHGVPMVGI P L FGDQ P ENMVRVEAKKFGVS I QLKKLKAET L 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLK7VETL 42 0 

Qy 421 ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I II I I 
Db 421 7VLKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 480 

Qy 481 HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 

I M I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 481 HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 



RESULT 6 
AB033538 

ID AB033538 standard; protein; 523 AA. 
XX 

AC AB033538; 
XX 

DT 17^SEP-2003 (first entry) 
XX 

DE Novel human secreted and transmembrane protein PRO1780. 
XX 

KW Human; secreted and transmembrane protein; PRO; gene therapy; vaccine; 
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DR WPI; 2003-555602/52. 

DR N-PSDB; ADC18150. 
XX 

PT Novel isolated PRO polypeptides e.g. PR01491 and PR01571, useful in the 

PT preparation of a medicament for treating a condition responsive to PRO 

PT polypeptide, and as therapeutic agents e.g. vaccines. 
XX 

PS Claim 12; SEQ ID NO 282; 555pp; English. 
XX 

CC The invention relates to human PRO polypeptides and the polynucleotides 

CC encoding them. The sequences are useful in the preparation of a 

CC medicament for treating a condition responsive to a PRO polypeptide. The 

CC polypeptides are useful in a number of functional biological assays, as 

CC molecular weight markers for protein electrophoresis and as therapeutic 

Query Match 100.0%; Score 2768; DB 7; Length 523; 

Best Local Similarity 100.0%; Pred. No. 6.3e-278; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNWMLNHKR 60 

GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 

CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 

L S YVP VFRS LLT DHMD FWGRVKN FLMFFS FCRRQQHMQ ST FDNT I KEH FT EGS RPVL S HL 240 

I I I U I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 1 I I I I I I I I I I I I I I I I I I 

LSYVPVFRS LLTDHMD FWGRVKN FLMFFS FCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 240 

LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVXVTL 300 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
LLKAELWFINSDFAFDFARPLLPNTVWGGLMEKPIKPVPQDLENFIAKFGDSGFVLWL 300 

GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVIILAANVKIVDWLPQSDLL^ 360 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVliLAANVKIVDWLPQSDLLAH 360 

PS I RLFVTHGGQNS IMEAI QHGVPMVGI PLFGDQPENMVRVEAKKFGVS I QLKKLKAET L 42 0 

i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i ri i i i i i i i i i i i i i i i i i i 

P S I RL FVTHGGQN S I MEAI QHGVPMVGI P L FGDQ P ENMVRVEAKKFGVS I QLKKLKAET L 42 0 

i\LKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 48 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I | | | | | I I I I 
ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 48 0 

HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 
I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 
HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 
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ADD70797 

ID ADD70797 standard; protein; 523 AA. 
XX 

AC ADD7 07 97; 
XX 

DT 15-JAN-2004 (first entry) 
XX 

DE Human secreted/ transmembrane protein PRO178 0. 
XX 

KW Human; secreted protein; transmembrane protein; PRO; tumour; 

KW immune response; cardiac insufficiency disorder; calcium flux; 

KW umbilical vein endothelial cell; bone disorder; cartilage disorder; 

KW arthritis; wound healing; diabetes; skeletal muscle cells; obesity; 

KW Berger disease; nephropathy; Schonlein-Henoch purpura; coeliac disease; 

KW dermatitis; herpetiformis; Crohn's disease; thalassaemia . 

XX 

OS Homo sapiens . 
XX 

PN US2003099625-A1. 
XX 

PD 29-MAY-2003. 
XX 

PF 12-DEC-2001; 2 001US-0001538 6 . 
XX 
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PR 23-AUG-2000; 2000WO-US023522 . 

PR 24-AUG-2000; 2000WO-US023328 . 

PR 08-NOV-2000; 2000WO-US030952 . 

PR 10-NOV-2000; 2000WO-US030873 . 

PR 01-DEC-2000; 2000WO-US032678 . 

PR 28-FEB-2001; 2 001WO-US006520 . 

PR 01-MAR-2001; 2001WO-US006666 . 

PR 01-JUN-2001; 2001WO-US017800 . 

PR 20-JUN-2001; 2001WO-US019692 . 

PR 29-JUN-2001; 2 001WO-US021066 . 

PR 09-JUL-2001; 2001WO-US021735 . 

PR 04-SEP-2001; 2001US-00946374 . 
XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Baker KP, Botstein D, Desnoyers L, Eaton DL, Ferrara N, Fong S; 

PI Gao W, Goddard A, Godowski PJ, Grimaldi JC, Gurney AL, Hillan K J; 

PI Pan J, Paoni NF, Roy MA, Smith V, Stewart TA, Tumas D, Watanabe CK; 

PI Williams PM, Wood WI ; 
XX 

DR WPI; 2003-874602/81. 

DR N-PSDB; ADD70796. 
XX 

PT Novel isolated PRO polypeptides e.g., PRO1130, PR01275, PR01418, PR01555, 

PT PR01787 affect glucose or free fatty acid (FFA) uptake by skeletal muscle 

PT cells and are useful for treating diabetes or hyper- or hypo-insulinemia . 
XX 

PS Claim 12; SEQ ID NO 282; 553pp; English. 
XX 

CC The invention relates to an isolated PRO polypeptide (secreted or 

Query Match 100.0%; Score 2768; DB 7; Length 523; 

Best Local Similarity 100.0%; Pred. No. 6.3e-278; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I 

MAGQRVLLLVGFLLPGVLLSEA7VKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 
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GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 

M | | | I I I I II I I I I I I I I I I M I I I I II I I I I I I I I I I I I I I I M I I I I I I I I I 

GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 

CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 

I I I I I I I I I I I I II I I I I I I I I M I I I I I I I I I I I I I I I I 

CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 

LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 240 

|| | | I I I I I I I I I I I I I I I I II II I I I I I M I I I I I I i I I I I I I I I I I I I I I I I 

LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 240 

LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 300 

| I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 300 
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301 GSMVNTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHLAANVKIVDWLPQSDLLAH 360 
I I I I I I I II I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I IN 



Db 



301 GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQC^^ 360 



Qy 361 PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I 

Db 361 P S I RLFVTHGGQN S IMEAI QHGVPMVGI P LFGDQP ENMVRVEAKKFGVS I QLKKLKAETL 420 

Qy 421 ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 4 80 

M I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Ml 

Db 421 ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 480 

Qy 481 HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 

I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I M I I I 

Db 481 HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 
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ID 


ADD39874 standard; protein; 523 AA. 
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ADD39874; 
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DT 


15-JAN-2004 


(first entry) 
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Human secreted/ transmembrane protein PRO1780. 
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KW 


Human; secreted protein; transmembrane protein; PRO; tumour; 


KW 


immune response; cardiac insufficiency disorder; calcium flux; 


KW 


umbilical vein endothelial cell; bone disorder; cartilage disorder; 


KW 


arthritis; wound healing; diabetes; skeletal muscle cells; obesity; 
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Berger disease; nephropathy; Schonlein-Henoch purpura; coeliac disease; 
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dermatitis; herpetiformis; Crohn's disease; thalassaemia . 
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Homo sapiens. 
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XX 

PA (GETH ) GENENTECH INC. 
XX 

PI Baker KP, Botstein D, Desnoyers L, Eaton DL, Ferrara N, Fong S; 

PI Gao W, Goddard A, Godowski PJ, Grimaldi JC, Gurney AL, Hillan KJ; 

PI Pan J, Paoni NF, Roy MA, Smith V, Stewart TA, Tumas D, Watanabe CK; 

PI Williams PM, Wood WI ; 
XX 

DR WPI; 2003-755122/71. 

DR N-PSDB; ADD39873. 
XX 

PT New secreted and transmembrane PRO polypeptides useful for treating 

PT cancers, kidney disorders, Crohn's disease, diabetes mellitus, hyper- or 

PT hypo-insulinemia, sports injuries and arthritis. 

XX 

PS Claim 12; SEQ ID NO 282; 557pp; English. 
XX 

CC The invention relates to an isolated PRO polypeptide (secreted or 

CC transmembrane protein) having at least 80% amino acid sequence identity 

CC to an amino acid sequence chosen from 123 fully defined sequences as 

CC given in the specification (including their extracellular domains either 

CC or without their associated signal peptides. Also include are the 

CC nucleotide (NA) sequences encoding PRO, a vector comprising the PRO NA, a 

CC host cell comprising the vector, producing PRO, a chimaeric molecule 

CC comprising PRO fused to a heterologous amino acid sequence, and an anti- 

CC PRO antibody. Pro is useful as molecular weight markers for protein 

CC electrophoresis and also for chromosome identification. PRO is also 

CC useful for tissue typing. PRO and PRO NA are useful as hybridisation 

CC probes for a cDNA library to isolate the full-length PRO cDNA. PRO NA is 

CC useful for generating transgenic animals or knock-out animals which are 

CC useful in development and screening useful reagents. PRO NA is also 

CC useful in gene therapy. PR01244, PR01286 and PRO1303 polypeptides are 

CC useful for treating cancerous tumours. PRO1250, PR01418 and PRO1410 

CC polypeptides are useful for suppressing immune response. PR01246 

CC polypeptide is useful for treating cardiac insufficiency disorders. 

CC PR01246 polypeptide is also useful for treating tumours. PR01246 and 

CC PR01561 polypeptide are useful for stimulating calcium flux in human 

CC umbilical vein endothelial cells. PR01265, PRO1250 and PR01474 

CC polypeptides are useful for treating bone and/or cartilage disorders 

CC (e.g., arthritis) and wound healing. PRO1130, PR01275 and PR01418 

CC polypeptides are useful for treating diabetes in skeletal muscle cells 

CC and obesity. PR01265, PR01244 and PR01382 polypeptides are useful for 

CC treating Berger disease or other nephropathies associated with Schonlein- 

CC Henoch purpura, coeliac disease, dermatitis, herpetiformis or Crohn's 

CC disease. PR01478, PR01265, PR01412, PR01279, PRO1304, PRO1306, PR01418, 



CC PRO1410 and PR01575 are useful in treating thalassaemias . The present 

CC sequence represents a PRO protein of the invention. 

XX 

SQ Sequence 523 AA; 



Query Match 100.0%; Score 2768; DB 7; Length 523; 

Best Local Similarity 100.0%; Pred. No. 6.3e-278; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 
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PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 


420 
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421 ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 


480 
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Db 


421 ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 
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HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 
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HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 





RESULT 10 
ADD70320 

ID ADD70320 standard; protein; 523 AA. 
XX 

AC ADD70320; 
XX 

DT 15-JAN-2004 (first entry) 
XX 

DE Human secreted/transmembrane protein PRO1780. 
XX 

KW Human; secreted protein; transmembrane protein; PRO; tumour; 



KW immune response; cardiac insufficiency disorder; calcium flux; 

KW umbilical vein endothelial cell; bone disorder; cartilage disorder; 

KW arthritis; wound healing; diabetes; skeletal muscle cells; obesity; 

KW Berger disease; nephropathy; Schonlein-Henoch purpura; coeliac disease; 

KW dermatitis; herpetiformis; Crohn's disease; thalassaemia . 

XX 

OS Homo sapiens . 
XX 

PN US2003054406-A1. 
XX 



PD 


20- 


MAR- 


2003. 




XX 












PF 


06- 


DEC- 


2001; 2001US- 


00006818. 


XX 
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PR 
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0099763P. 


PR 


10- 


SEP- 
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0099792P. 


PR 


10- 


SEP- 
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; 98US- 


0099808P. 


PR 
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SEP- 


1998, 


; 98US- 


0099812P. 


PR 


10- 


SEP- 


1998, 


; 98US- 
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Query Match 100.0%; Score 2768; DB 7; Length 523; 

.Best Local Similarity 100.0%; Pred. No. 6.3e-278; 
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Qy 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

I | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I 

Db 1 MAGQRVX,LLVGFLLPGVLLSEA7\KILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 
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PT Novel isolated PRO polypeptide, useful for treating cancerous tumors, 
PT cardiac insufficiency disorders, wound healing, diabetes mellitus, 
PT thalassemias. 
XX 

PS Claim 12; SEQ ID NO 282; 556pp; English. 
XX 

CC The invention relates to an isolated PRO polypeptide (secreted or 

CC transmembrane protein) having at least 80% amino acid sequence identity 

Query Match 100.0%; Score 2768; DB 7; Length 523; 

Best Local Similarity 100.0%; Pred. No. 6.3e-278; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I 

Db 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

Qy 61 GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 

I I I I I I I I I I I I I I I I I II I I II I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I 

Db 61 GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 

Qy 121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 

Qy 181 LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 240 

I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I 

Db 181 LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 240 



Qy 

Db 



241 
241 



LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 



300 
300 



Qy 301 GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHLAANVKIVDWLPQSDLLAH 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 GSMWTCQNPEIFKEMNNAFAHLPQG 360 

Qy 361 P S I RLFVTHGGQNS IMEAIQHGVPMVGI PLFGDQP ENMVRVEAKKFGVS IQLKKLKAETL 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 361 PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 420 

Qy 421 ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 421 ALKMKQIMEDKRYKSAAV7^VSVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 480 

Qy 481 HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 481 HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 



RESULT 12 
ADD39397 
ID 
XX 
AC 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
KW 
KW 
KW 
KW 
XX 
OS 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 
PR 



ADD39397 standard; protein; 523 AA. 
ADD39397; 

15-JAN-2004 (first entry) 

Human secreted/transmembrane protein PRO1780. 

Human; secreted protein; transmembrane protein; PRO; tumour; 
immune response; cardiac insufficiency disorder; calcium flux; 
umbilical vein endothelial cell; bone disorder; cartilage disorder; 
arthritis; wound healing; diabetes; skeletal muscle cells; obesity; 
Berger disease; nephropathy; Schonlein-Henoch purpura; coeliac disease; 
dermatitis; herpetiformis; Crohn's disease; thalassaemia . 

Homo sapiens. 
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PT . New isolated PRO polypeptides such as PRO1560, PR0444, PRO1018, PR01773, 
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CC The invention relates to an isolated PRO polypeptide (secreted or 

Query Match 100.0%; Score 2768; DB 7; Length 523; 

Best Local Similarity 100.0%; Pred. No. 6.3e-278; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I 

Db 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

Qy 61 GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 

Qy 121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 

I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I II I I I I 

Db 121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 

Qy 181 LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 240 

Qy 241 LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 300 
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Db 241 LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 300 

Qy 301 GSMVNTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHLAANVKIVDWLPQSDLLAH 360 
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Db 301 GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDWLAANVKIVDWLPQSDLLAH 360 

Qy 361 PS I RLFVTHGGQNS IMEAIQHGVPMVGI PLFGDQPENMVRVEAKKFGVS IQLKKLKAETL 420 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I 

Db 361 PSIRLFVTHGGQNSIME^IQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 420 

Qy 421 ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 48 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
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Db 4 81 HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 
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Query Match 100.0%; Score 2768; DB 7; Length 523; 

Best Local Similarity 100.0%; Pred. No. 6.3e-278; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 



Db 



1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1 MAGQRVLLLVGFLLPGVLLSE7UVKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 



Qy 



61 GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 12 0 




Db 



61 GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 



Qy 



121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 



Db 121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 

Qy 181 LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I 
Db 181 LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 240 

Qy 241 LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGHVLVTL 300 

Qy 301 GSMWTCQNPEIFKEIWNAFAHLPQGVIWKCQCSHWPKDVHLAANVKIVDWLPQSDL^ 360 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 GSMWTCQNPEIFKE^NAFAHLPQGVIWK^^ 360 

Qy 361 PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 420 

I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 361 P S I RLFVTHGGQNS IMEAI QHGVPMVGI P LFGDQ PENMVRVEAKKFGVS I QLKKLKAETL 420 

Qy 421 ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 480 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 421 ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 480 

Qy 481 HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 481 HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 
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Job time : 62 sees 



GenCore version 5.1.6 
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Search time 22 Seconds 
(without alignments) 
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Perfect score: 
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Scoring table: 



US-10-017-867A-282 
2768 

1 MAGQRVLLLVGFLLPGVLLS . 



. GKLLGMAVWWLRGARKVKET 523 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 



389414 



Searched: 389414 seqs, 51625971 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Issued_Patents_AA: * 

1: /cgn2_6/ptodata/2/iaa/5A_COMB.pep: * 

2 : /cgn2_6/ptodata/2/iaa/5B_COMB.pep: * 

3: /cgn2_6/ptodata/2/iaa/6A_COMB.pep: * 

4 : /cgn2_6/ptodata/2/iaa/6B_COMB.pep: * 

5: /cgn2_6/ptodata/2/iaa/PCTUS_COMB.pep:+ 

6 : /cgn2_6/ptodata/2/iaa/backf ilesl . pep : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 

PCT-US92-00282-3 

; Sequence 3, Application PC/TUS9200282 
; GENER7VL, INFORMATION: 

APPLICANT: OWENS, IDA S. 
APPLICANT: RITTER, JOSEPH K. 
; TITLE OF INVENTION: THE GENETIC LOCUS UGT1 AND A MUTATION 
; TITLE OF INVENTION: THEREIN. 
NUMBER OF SEQUENCES: 40 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: CUSHMAN DARBY & CUSHMAN 
; STREET: 1615 L STREET, N.W. 

CITY: WASHINGTON 
STATE : D . C . 
; COUNTRY: U.S.A. 

ZIP: 20036-5601 
COMPUTER READABLE FORM: 
; MEDIUM TYPE: Floppy disk 



COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US92/ 00282 

FILING DATE: 19920110 
; CLASSIFICATION: 435 

ATTORNEY/AGENT INFORMATION: 
; NAME: SCOTT, WATSON T. 

REGISTRATION NUMBER: 2 6581 

REFERENCE/DOCKET NUMBER: 91532-PCT 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 202-861-3000 

TELEFAX: 202-822-0944 

TELEX: 6714 627 CUSH 
INFORMATION FOR SEQ ID NO: 3: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 533 amino acids 

TYPE: AMINO ACID 

STRANDEDNESS: single 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
PCT-US92-00282-3 

Query Match 24.3%; Score 674; DB 5; Length 533; 

Best Local Similarity 33.8%; Pred. No. 5.9e-63; 

Matches 175; Conservative 92; Mismatches 212; Indels 38; Gaps 13; 

Qy 8 LLVGFLL — PGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKRGPFMP 65 

I : : I I I I :: I I I I I I I I I I : I I III I I : : I I 

Db 11 LVLGLLLCVLGPWSHAGKILLI-PVDGSHWLSMLGAIQQLQQRGHEIWL AP 62 

Qy 66 D FKKEEKS YQVI SWLAPEDHQRE- FKKS F DFFLEETLGGRGKFENLL 111 

I : : : I : : : I Nihil III:: : : : 

Db 63 DASLYIRDGAFYTLKTY — P VP FQRED VKE S FVS LGHNVFEN D S FLQRVI KTYKKIK 117 

Qy 112 NVLEYLTVLQCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFG 171 

I I I I I : I : : I I I : | | : : : : I I : : I : I II I 
Db 118 KDSAMLLSGCSHLLHNKELMASLAESSFDVMLTDPFLPCSPIVAQYLSLPTVFFLHALPC 177 

Qy 172 SLEF GLPIPLSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEH 228 

I II I I I I I I I I : I II I I I I I I : I I : | : | : 

Db 178 SLEFEATQCPNPFSYVPRPLSSHSDHMTFLQRVKNMLIAFSQNFLCDWYSPY-ATLASE 236 

Qy 22 9 FTEGSRPVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIA 2 88 

I : : II I : I III I : I I : : I I I : I I I : I : I : I. : I 

Db 237 FLQ-REVTVQDLLSSASVWLFRSDFVKDYPRPIMPNMVFVGGINCLHQNPLSQEFEAYIN 295 

Qy 28 9 KFGDSGFVTjWLGSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHLAANVKI 34 8 

I : I I : : I I I II : : : : I I I I : I : : : : I I I : 
Db 296 ASGEHGIWFSLGSMVSEIPEKKAM-AIADALGKNPQTVLWRYTGT RPSNLANNTIL 351 

Qy 349 VDWLPQSDLLAHPSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGV 408 

llllhlllll I I : I I I : : I : I : I I I I I : I I I I I I : I hi I I I 

Db 352 VKWLPQNDLLGHPMTRAFITHAGSHGVYESICNGVPMVMMPLFGDQMDNAKRMETKGAGV 411 



QY 



409 SIQLKKLKAETLALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGA 468 



:::::: I I : I : : I I I I : I : : I : I | | : : I : : II 
Db 412 TLNVLEMTSEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFVMRHKGA 471 

Qy 469 THLKPYVFQQPWHEQYLFDVEVFLLGLTLGTLWLCGK 505 

I I : I I :: : II I I I : I :: I 

Db 472 PHLRPAAHDLTWYQYHSLDVIGFLLAWLTVAFITFK 508 



RESULT 2 
US-09-356-806-8 

; Sequence 8, Application US/09356806 

; Patent No. 6586175 

; GENERAL INFORMATION: 

; APPLICANT: Penny, Laura 

; APPLICANT: Galvin, Margaret 

; APPLICANT: Miller, Andrew 

; APPLICANT: Reidy, Michael 

; TITLE OF INVENTION: Genotyping Human 

; TITLE OF INVENTION: UDP-Glucuronosyltrans f erase 2B4 (UGT2B4 ) , 2B7 (UGT2B7) 
and 

; TITLE OF INVENTION: 2B15 (UGT2B15) Genes 
; FILE REFERENCE: SEQ-22PRV2 

; CURRENT APPLICATION NUMBER: US/09/356, 806 
; CURRENT FILING DATE: 1999-07-20 
; NUMBER OF SEQ ID NOS : 164 

; SOFTWARE: FastSEQ for Windows Version 3.0 

; SEQ ID NO 8 

; LENGTH: 528 

; TYPE: PRT 

; ORGANISM: H. sapiens 

US-09-356-806-8 



Query Match 23.9%; Score 661.5; DB 4; Length 528; 

Best Local Similarity 34.2%; Pred. No. 1.3e-61; 

Matches 17 3; Conservative 87; Mismatches 215; Indels 31; Gaps 15; 

Qy 34 SHYLLMDRVSQILQDHGHNVTMLNHKRG PFMPDFKKEEKSYQVISWLAPEDHQREF 89 

II:::: I I I I I : I II II : : I I : : 

Db 34 SHWMNIKTILDELVQRGHEVTVLASSASISFDPNSPSTLKFEVYPVSLTKTEFEDIIKQL 93 

Qy 90 KKS FDFFLEETLGGRGKFENLLNVL EYLALQCSHFLNRKDIMDSLKNENFDMVIVE 145 

I : : : I I : : : : I I : : I : I I : I I : I : : 

Db 94 VKRWAELPKDTFW — SYFSQVQEIMWTFNDILRKFCKDIVSNKKLMKKLQESRFDWLAD 151 

Qy 146 TFDYCPF — LIAEKLGKPFVAILSTSFG-SLE FGLPI PLS YVPVFRSLLTDHMDFWG 199 

II 1:111 III I I I : : I II I I I I I I I hi I I 

Db 152 A--VFPFGELLAELLKIPFVYSLRFSPGYAIEKHSGGLLFPPSYVPWMSELSDQMTFIE 209 



Qy 200 RVKN— FLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHLLLKAELWFINSDFAFDF 257 

I I I I : : : : I I : I : : I I I II : I I : : I I : : I I 

Db 210 RVKNMI YVLYFEFWFQI FDMKK-WDQFYSE — VLGRPTTLSETMAKADIWLI RNYWDFQF 266 

Qy 258 Tu^PLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTLGSMV-NTCQNPEIFKEM 316 

Mill : I I I I I I I I : I : : : I I : I : : I I : = I I I I I I I : I 

Db 267 PHPLLPNVEFVGGLHCKPAKPLPKEMEEFVQSSGENGVWFSLGSMVSNTSE--ERANVI 324 



Qy 



317 NNAFAHLPQGVIWKCQCSHWPKDVHLAANVKIVDWLPQSDLLAHPSIRLFVTHGGQNSIM 37 6 



• I i ■ i i i • i • ; i i i i • i i • i m ii i i • i j i i i i 

Db 325 ASALAKI PQKVLWRFDGN KPDTLGLNTRLYKWI PQNDLLGHPKTRAFITHGGANGI Y 381 

Qy 377 EAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETLALKMKQIM 436 

I I I I I : I I I I : I I I I I I : I : : : I I I I : : : I : I : : I I I 
Db 382 EAIYHGIPMVGVPLFADQPDNIAHMKAKGAAVSLDFHTMSSTDLLNALKTVINDPLYKEN 441 

Qy 437 AVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPWHEQYLFDVFVFLLGLT 4 96 

I : II I : I I I I I : I : : I I I I : I : : II III 

Db 442 AMKLSRIHHDQPVKPLDRAVFWIEFVMRHKGAKHLRVA7UIDLTWFQYHSLDVTGFLLACV 501 

Qy 4 97 LGTLWLCGKLLGMAVW-WLRGARKVK 521 

: : : I I I I : : I : I I 
Db 502 ATVIFIITKCL-FCVWKFVRTGKKGK 526 



RESULT 3 

PCT-US92-00282-6 

; Sequence 6, Application PC/TUS9200282 
; GENERAL INFORMATION: 

APPLICANT: OWENS, IDA S. 
APPLICANT: RITTER, JOSEPH K. 
; TITLE OF INVENTION: THE GENETIC LOCUS UG.Tl AND A MUTATION 

TITLE OF INVENTION: THEREIN. 
; NUMBER OF SEQUENCES: 40 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: CUSHMAN DARBY & CUSHMAN 
STREET: 1615 L STREET, N.W. 
CITY: WASHINGTON 
STATE: D.C. 
COUNTRY: U.S.A. 
ZIP: 20036-5601 
; COMPUTER READABLE FORM: 
; MEDIUM TYPE: Floppy disk 

; COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US92/ 00282 
; FILING DATE: 19920110 

CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 

NAME: SCOTT, WATSON T. 
; REGISTRATION NUMBER: 26581 

; REFERENCE/ DOCKET NUMBER: 91532-PCT 

TELECOMMUNICATION INFORMATION: 

TELEPHONE: 202-861-3000 

TELEFAX: 202-822-0944 

TELEX: 6714 627 CUSH 
; INFORMATION FOR SEQ ID NO: 6: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 531 amino acids 

; TYPE: AMINO ACID 

; STRANDEDNESS: single 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
PCT-US92-00282-6 



Query Match 23.8%; Score 658; DB 5; Length 531; 

Best Local Similarity 30.9%; Pred. No. 3e-61; 

Matches 171; Conservative 106; Mismatches 211; Indels 66; Gaps 17; 



Qy 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

: I Ml: II : I I : I : : I I I : I I I : I II -I 

Db 7 LRGLSGLLLLLCALP WAEGGKVL-VFPMEGSHWLSMRDWRELHARGHQAWL 58 

Qy 61 GPFMPDFKKEEKSYQVI SWLAP EDHQREF KKSFD FFLEETLGGRGK 106 

I : | | : : : : | I : : I I I III: I I I I : I 

Db 59 APEVTVHMKGEDFFTLQTYAFPYTKEEYQREILGNAKKGFEPQHFVKTFF — ETMASIKK 116 

Qy 107 FENLLNVLEYLALQCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAIL 166 

I : I | | : | : | : : | : : | | : I : : I I : I : I I I I 

Db 117 FFDL YANSCAALLHNKTLIQQLNSSSFDWLTDPVFPCGT^LAKYLQI PAVFFL 170 

Qy 167 STSFGSLEF GLPIPLSYVPVFRSLLTDHMDFWGRVKNFLMFFS FCRRQQHMQST 22 0 

: : : : I I I I : I : : I : I I I I I I I I I : I hi 

Db 171 RSVPCGIDYEATQCPKPSSYIPNLLTMLSDHMTFLQRVKNMLYPLTLKYIC HLSIT 226 

Qy 221 FDNTIKEHFTEGSR PVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIK 277 

: : : I I I I I : I II I h I h : I I h : I h I 

Db 22 7 PYESLASELLQREMSLVEVLSH ASVWLFRGDFVFDYPRP IMPNMVFI GGINCVI KK 282 

Qy 278 PVPQDLENFIAKFGDSGFVLVTLGSMVNTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWP 337 

I : I : I : : I : I h : I I I I h : h I : I I : : I : : 

Db 283 PLSQEFEAYVNASGEHGIWFSLGSMVSEI PEKKAM-EIAEALGRI PQTLLWRYTGT 338 

Qy 338 KDVHLAANVK I VDWL PQ S DLLAH P S I RL FVTHGGQN S IMEAI QHGVPMVG I P L FGDQ PEN 397 

: : I I I : I I I I I : I I I I I I I : I I I : I I I : I I I I I : I I I I I I : I 

Db 339 RPSNLAKNTILVKWLPQNDLLGHPKARAFITHSGSHGIYEGICNGVPMVMMPLFGDQMDN 398 

Qy 398 MVRVEAKKFGVSIQLKKLKAETLALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVG 457 

I : I : | | : : : : : | : I : I : : : I I I : I : : hi I 
Db 399 7VKRMETRGAGVTLNVLEMTADDLENALKTVINNKSYKENIMRLSSLHKDRPIEPLDLAVF 458 

Qy 458 WIDHVLQTGGATHLKPYVFQQPWHEQYLFDVFVFLLGLTLGTLWL CGKLLGM 509 

|:::h: II I hi | : : : | | | | | : | : : : III 

Db 459 WVEYVMRHKGAPHLRPAAHDLTWYQYH S LDVI GFLLAI VLTWFI VYKS CAYGCRKC FG- 517 

Qy 510 AVWWLRGARKVKET 523 

I : I h : 

Db 518 GKGRVKKS 525 



RESULT 4 
US-09-180-852-2 

; Sequence 2, Application US/09180852 

; Patent No. 6287834 

; GENERAL INFORMATION : 

; APPLICANT: BELANGER, Alain 

; APPLICANT: HUM, Dean W. 

; APPLICANT: BEAULIEU, Martin 

; APPLICANT: LEVESQUE, Eric 

; TITLE OF INVENTION: CHARACTERIZATION AND USE OF AN ISOLATED URIDINE 
; TITLE OF INVENTION: DIPHOSPHO-GLUCURONOSYLTRANSFERASE 



; FILE REFERENCE: 1259-449 

; CURRENT APPLICATION NUMBER: US/09/180,852 

; CURRENT FILING DATE: 1999-02-08 

; EARLIER APPLICATION NUMBER: PCT/CA97/00328 

; EARLIER FILING DATE: 1997-05-16 

; EARLIER APPLICATION NUMBER: US 08/649,319 

; EARLIER FILING DATE: 1996-05-17 

; NUMBER OF SEQ ID NOS : 2 

; SOFTWARE: Patentln Ver. 2.0 

; SEQ ID NO 2 

LENGTH: 530 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-180-852-2 



Query Match 23.7%; Score 657; DB 3; Length 530; 

Best Local Similarity 32.6%; Pred. No. 3.9e-61; 

Matches 159; Conservative 92; Mismatches 211; Indels 26; Gaps 11; 



Qy 


34 


SHYLLMDRVSQILQDHGHNVTMLNHKRGPFMPDFKKEEKSYQVI-SWLAPEDHQREFKKS 

1 1 : : 1 : : 1 1 1 1 : 1 : 1 : 1 : 1 1=11 
S HW I NMKT I LE E L VQ RGH EVI VLT S S AS I L WAS K S S AI KL E VY P T S LT KN D L E D F FMKM 


92 


Db 


34 


93 


Qy 


93 


FDFFLEETLGGRGKFENLLNVLEYLALQCSHF LNRKDIMDSLKNENFDMVI 

II: : | : : | : | : | : I I : I : I I : 1 1 :: : 
FDRWT YS I — S KNT FWS YFSQLQELCWE YS D YNI KLCEDAVLNKK- LMRKLQES KFDVLL 


143 


Db 


94 


150 


Qy 


144 


VETFDYCPFLIAEKLGKPFVAILSTSFG-SLE FGLPIPLSYVPVFRSLLTDHMDFWG 

: : 1 1 : 1 1 1 1 1 : 1 1 1 : : 1 1 1 Mill 1 1 : 1 1 1 
ADAVNPCGELLAELLNI PFLYSLRFSVGYTVEKNGGGFLFPPS YVPWMSELSDQMI FME 


199 


Db 


151 


Z ID 


Qy 


200 


RVKN — FLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHLLLKAELWFINSDFAFDF 
1 : 1 1 : : : : 1 1 : : : : 1 1 1 1 : | | | : | | : : I : 1 
RIKNMIYMLYFDFWFQAYDLKK-WDQFYSE — VLGRPTTLFETMGKAEMWLIRTYWDFEF 


257 


Db 


211 


267 


Qy 


258 


ARPLLPNTVTVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTLGSMVNTCQNPEIFKEMN 

1 1 1 1 1 : 1 1 1 1 I I I I : I : : : 1 1 : 1 : : 1 1 : : 1 1 1 1 : : : 1 : 

PRPFLPNVDFVGGLHCKPAKPLPKEMEEFVQSSGENGIWFSLGSMISN-MSEESANMIA 


317 


Db 


268 


326 


Qy 


318 


NAFAHLPQGVIWKCQCSHWPKDVHLAANVKIVTWLPQSDLLAHPSIRLFVTHGGQNSIME 

: I I : 1 1 1 : 1 : 1 1 j 1 : : 1 1 1 1 : 1 1 1 II : 1 : 1 1 1 1 1 1 1 

SALAQI PQKVLWRFD GKKPNTLGSNTRLYKWLPQNDLLGHPKTKAFITHGGTNGIYE 


377 


Db 


327 


383 


Qy 


378 


AIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETLALKMKQIMEDKRYKSAA 
1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 : 1 : : : 1 1 : | : : : : : 1 : 1 : : 1 II 
AI YHGI PMVGI PLFADQHDN I AHMKAKGAALS VDI RTMS S RDLLNALKSVINDP I YKEN I 


437 


Db 


384 


443 


Qy 


438 


VAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPWHEQYLFDVFVFLLGLTL 

: II 1 : 1 1 1 1 1 : 1 : : MM: 1 = : M Ml 
MKLS RI HHDQPVKPLDRAVFWI E FVMRHKGAKHLRVAAHNLTWI QYHS LDVI AFLLACVA 


497 


Db 


444 


503 


Qy 


498 


GTLWLCGK 505 




Db 


504 


: : : 1 

TMIFMITK 511 





RESULT 5 



US-09-356-806-40 

; Sequence 40, Application US/09356806 

; Patent No. 6586175 

; GENERAL INFORMATION: 

; APPLICANT: Penny, Laura 

; APPLICANT: Galvin, Margaret 

; APPLICANT: Miller, Andrew 

; APPLICANT: Reidy, Michael 

; TITLE OF INVENTION: Genotyping Human 

; TITLE OF INVENTION: UDP-Glucuronosyltrans f erase 2B4 (UGT2B4), 2B7 (UGT2B7) 
and 

; TITLE OF INVENTION: 2B15 (UGT2B15) Genes 

FILE REFERENCE: SEQ-22PRV2 
; CURRENT APPLICATION NUMBER: US/09/356,806 
; CURRENT FILING DATE: 1999-07-20 
; NUMBER OF SEQ ID NOS : 164 

SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 40 
; LENGTH: 524 
TYPE: PRT 

ORGANISM: H. sapiens 
US-09-356-806-40 

Query Match 23.5%; Score 651.5; DB 4; Length 524; 

Best Local Similarity 31.5%; Pred. No. 1.5e-60; 

Matches 163; Conservative 87; Mismatches 212; Indels 55; Gaps 14; 

Qy 34 SHYLLMDRVSQILQDHGHNVTMLNHKRG-PFMPD FKKEEKSY Q 75 

II:::: I I I I I : I I I : I I : : : I 

Db 34 SHWMNIKTILDELIQRGHEVTVLASSASILFDPNNSSALKIEIYPTSLTKTELENFIMQQ 93 

Qy 76 VISWL-APEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ-CSHFLNRKDIMDS 133 

: | | : | : | : : ::::: ::( ::|j 

Db 94 IKRWSDLPKD TFWLYFSQV QEIMSIFGDITRKFCKDWSNKKFMKK 139 

Qy 134 LKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTS FGSLEFGLPI PLS YVPVFRS 189 

: : II::: I I : I I I I I I I I I I I I I I I I I 

Db 140 VQESRFDVIFAJ3AIFPCSELL7VELFNIPFVTSLSFSPGYTFEKHSGGFIFPPSWPVVMS 199 

Qy 190 LLTDHMDFWGRVKN — FLMFFSFCRRQQHMQSTFDNTIKEHFTE GSRPVLSHLLLKA 244 

I I I I I I I I I : : : : I I II : I I I I : I I 

Db 2 00 ELTDQMTFMERVKNMIYVLYFDF WFEIFDMKKWDQFYSEVLGRPTTLSETMGKA 253 

Qy 245 ELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTLGSMV 304 

: : I I : : I I I I I I I Mill I I I I : I : : : I : I : | : : | I : : I I I I I 
Db 2 54 DVWLIRNSWNFQFPYPLLPNVDFVGGLHCKPAKPLPKEMEDFVQSSGENGVWFSLGSMV 313 

Qy 305 NTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHLAANVKIVDWLPQSDLLAHPSIR 364 

: | : : | | : I I I : I : : I I I : : I : I I : I I I I I I 
Db 314 SN-MTEERANVIASALAQI PQKVLWRFDGN KPDTLGLNTRLYKWI PQNDLLGHPKTR 369 

Qy 365 LFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETLALKM 424 

I : I I I I I I I I I I I : I I I I I I I I I I I : I : : : I : I : : : I : 

Db 370 AFITHGGANGIYEAIYHGIPMVGIPLFADQPDNIAHMKARG7\AVRVDFNTMSSTDLLN7VL 429 

Qy 425 KQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPWHEQY 484 

I : : : I I I : I I I : I I I I I : I : : I I I I : I : : 



Db 430 KRVINDPSYKENVMKLSRIQHDQPVKPLDRAVFWIEFVMRHKGAKHLRVAAHDLTWFQYH 48 9 

Qy 48 5 LFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVK 521 

I I I I I : : : I I : I I I I 

Db 490 SLDVIGFLLVCVATVIFIVTKCCLFCFW — KFARKAK 524 



RESULT 6 

US-09-356-806-113 

; Sequence 113, Application US/09356806 

; Patent No. 6586175 

; GENERAL INFORMATION: 

; APPLICANT: Penny, Laura 

; APPLICANT: Galvin, Margaret 

; APPLICANT: Miller, Andrew 

; APPLICANT: Reidy, Michael 

; TITLE OF INVENTION: Genotyping Human 

; TITLE OF INVENTION: UDP-Glucuronosyltrans f erase 2B4 (UGT2B4), 2B7 (UGT2B7) 
and 

; TITLE OF INVENTION: 2B15 (UGT2B15) Genes 
; FILE REFERENCE: SEQ-22PRV2 

; CURRENT APPLICATION NUMBER: US/09/356, 8 0 6 
; CURRENT FILING DATE: 1999-07-20 
; NUMBER OF SEQ ID NOS: 164 

; SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 113 

LENGTH: 530 

TYPE: PRT 

ORGANISM: H. sapiens 
US-09-356-806-113 

Query Match 23.4%; Score 649; DB 4.; Length 530; 

Best Local Similarity 31.8%; Pred. No. 2.8e-60; 

Matches 155; Conservative 90; Mismatches 218; Indels 24; Gaps 10; 

Qy 34 SHYLLMDRVSQI LQDHGHNVTMLNHKRGP FMPDFKKEEKS YQVI - SWLAPEDHQREFKKS 92 

I I : : I : : I I I I I : I : I : I : I I : I 

Db 34 SHWINMKTILEELVQRGHEVTVLTSSASTLVNASKSSAIKLEVYPTSLTKNDLEDSLLKI 93 

Qy 93 FDFFLEETLGGRGKFENLLNVLEYLALQ CSHFLNRKDIMDSLKNENFDMVIV 144 

I : : : | : : | : | : I : I : I I : I I : : : 

Db 94 LDRWIYGV— SKNTFWSYFSQLQELCWEYYDYSNKLCKDAVLNKKLMMKLQESKFDVILA 151 

Qy 145 ETFDYCPFLIAEKLGKPFVAILSTSFG-SLE FGLPIPLSYVPVFRSLLTDHMDFWGR 200 

: : I 1:11 I I : I MM I I I I I I I I Ml I I I 

Db 152 DALNPCGELLAELFNIPFLYSLRFSVGYTFEKNGGGFLFPPSYVPWMSELSDQMIFMER 211 

Qy 201 VKNF— LMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHLLLKAELWFINSDFAFDFA 258 

: I I : : : I I : : : : I I I I : M M I I : : M I 

Db 212 IKNMIHMLYFDFWFQIYDLKK-WDQFYSE--VLGRPTTLFETMGKAEMWLIRTYWDFEFP 268 

Qy 259 RPLLPNTVTVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTLGSMVNTCQNPEIFKEMNN 318 

M Ml : M I I II I |: M : : I M I : : I I : : II M : : : I : : 

Db 269 RPFLPNVDFVGGLHCKPAKPLPKEMEEFVQSSGENGIWFSLGSMISN-MSEESANMIAS 327 

Qy 319 AFAHLPQGVIWKCQCSHWPKDV1ILAANVKIVDWLPQSDLLAHPSIRLFVTHGGQNSIMEA 37 8 

I | : M MM I I : I : : I M M I I I I I : M I I I I I I I I 



Db 



328 ALAQI PQKVLWRFD GKKPNTLGSNTRLYKWLPQNDLLGHPKTKAFITHGGTNGIYEA 384 



Qy 



379 IQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETLALKMKQIMEDKRYKSAAV 438 



Db 




Qy 



Db 



439 AASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPWHEQYLFDVFVFLLGLTLG 498 

II I : I I I I I : I : : I I I I : I : : II I I I 

445 KLSRIHHDQPMKPLDRAVFWIEFVMRHKGAKHLRVAMNLTWIQYHSLDVIAFLLACVAT 504 



Qy 



4 99 TLWLCGK 505 



Db 



505 VIFIITK 511 



RESULT 7 

PCT-US92-00282-5 

; Sequence 5, Application PC/TUS92002 82 
; GENERAL INFORMATION: 

APPLICANT: OWENS, IDA S. 

APPLICANT: RITTER, JOSEPH K. 

TITLE OF INVENTION: THE GENETIC LOCUS UGT1 AND A MUTATION 
.TITLE OF INVENTION: THEREIN. 
NUMBER OF SEQUENCES : 40 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: CUSHMAN DARBY & CUSHMAN 
STREET: 1615 L STREET, N.W. 
CITY: WASHINGTON 
; STATE: D.C. 

; COUNTRY: U.S.A. 

; ZIP: 20036-5601 

; COMPUTER READABLE FORM: 

; MEDIUM TYPE: Floppy disk 

; COMPUTER: IBM PC compatible 

; OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US92/00282 
FILING DATE: 19920110 
CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION : 
NAME : SCOTT, WATSON T. 
REGISTRATION NUMBER: 26581 
REFERENCE/ DOCKET NUMBER: 91532-PCT 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 202-861-3000 
TELEFAX: 202-822-0944 
; TELEX: 6714627 CUSH 

; INFORMATION FOR SEQ ID NO: 5: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 531 amino acids 

; TYPE: AMINO ACID 

STRANDEDNESS: single 
; TOPOLOGY: linear 

; MOLECULE TYPE: protein 
PCT-US92-00282-5 



Query Match 22.7%; Score 629; DB 5; Length 531; 

Best Local Similarity 30.4%; Pred. No. 3.8e-58; 

Matches 159; Conservative 100; Mismatches 218; Indels 46; Gaps 13; 



Qy 


4 


QRVLLLVGFL-LPGVLLSEAAKI LTI STVGGSHYLLMDRVSQI LQDHGHN VTMLNHKRGP 
II: Mill:::: | : | : 1 1 1 : 1 1 : : : 1 I 1 1 : : : 
QRI SAGVFFLALWGMWGD — KLLWPQ-DGSHWLSMKDIVEVLSDRGHEIVW 


62 


Db 


9 


59 


Qy 


63 


FMPDFK KEEKS YQVI SWLAPEDHQREFKKS FDFFLEETLGGRG KFENLLN 

: 1 : 1 1 1 1 : 1 1 1 1 1 : 1 1 : : 1 : 
-VPEVNLLLKEYKYYTRKI YPVP YD-QEELKNRYQS FGNNHFAERS FLTAPQTEYRNNMI 


112 


Db 


60 


117 


Qy 


113 


VLEYLALQCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIMKLGKPFVAILSTSFGS 

1 : : 1 | : | :: | | I : : 1 :: 1 1 1 1 1 1 : 1 

VIGLYFINCQSLLQDRDTLNFFKESKFDALFTDPALPCGVILAEYLGLPSVYLFRGFPCS 


172 


Db 


118 


177 


Qy 


173 


LEFGL PI PLS YVPVFRSLLTDHMDFWGRVKNFL MFFSFCRRQQHMQSTFD 

II 1 1 : 1 1 : 1 : : 1 1 1 1 1 1 1 1 1 : 1 : : : : 1 
LEHTFSRSPDPVSYIPRCYTKFSDHMTFSQRVANFLVNLLEPYLFYCLFSKYEKLASA — 


222 


Db 


178 


235 


Qy 


223 


NTIKEHFTEGSRPVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQD 

: 1 II 1 : : 1 : II : : 1 1 : : 1 1 1 : : 1 1 : 1 1 : 1 : 
-VLK — — RDVDIITLSEVSVWLLRYDFVLEYPRPVMPNMVFIGGINCKKRKDLSQE 


282 


Db 


236 


287 


Qy 


283 


LENFIAKFGDSGFVLVTLGSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHL 


342 


Db 


288 


1 : 1 1 : 1 1 : : 1 1 1 1 1 : : : : 1 1 1 1 : 1 : : : : 1 

FEAYINASGEHGIWFSLGSMVSEIPEKKAM-AIADALGKNPQTVLWRYTGT RPSNL 


343 


Qy 


343 


AANVKIVDWLPQSDLLAHPSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVE 

1 1 .1 llll.lll II 1 l.ll 1 . • 1*1 *lllll . 1 [ 1 II 1 * 1 1*1 


402 


Db 


344 


1 ! : 1 II 1 1 : 1 1 1 II 1 l-ll 1 : • 1*1 ♦Mill • II 1 II 1 -1 1*1 

T^NTILVKWLPQNDLLGHPMTRAFITHAGSHGVYESICNGVPMVMMPLFGDQMDNAKRME 


403 


Qy 


403 


AKKFGVSIQLKKLKAETLALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHV 
1 | | :::::: I I : I : : I I I I : 1 : : 1 : 1 1 1 : : 1 
TKGAGVTLWLEMTSEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFV 


462 


Db 


404 


463 


Qy 


463 


LQTGGATHLKPYVFQQPWHEQYLFDVFVFLLGLTLGTLWLCGK 505 
: : 1 1 1 1 : 1 1 : : : II 1 1 1 : 1 : : 1 
MRHKGAPHLRPAAHDLTWYQYHSLDVIGFLLAWLTVAFITFK 506 




Db 


464 





RESULT 8 

PCT-US92-00282-4 

; Sequence 4, Application PC/TUS9200282 
; GENERAL INFORMATION: 

APPLICANT: OWENS, IDA S. 

APPLICANT: RITTER, JOSEPH K. 

TITLE OF INVENTION: THE GENETIC LOCUS UGT1 AND A MUTATION 
; TITLE OF INVENTION: THEREIN. 
; NUMBER OF SEQUENCES: 40 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: CUSHMAN DARBY & CUSHMAN 

STREET: 1615 L STREET, N.W. 

CITY: WASHINGTON 
; STATE: D.C. 

COUNTRY: U.S.A. 

ZIP: 20036-5601 



; COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US92/00282 

FILING DATE: 19920110 
; CLASSIFICATION: 435 

; ATTORNEY/AGENT INFORMATION: 
; NAME: SCOTT, WATSON T. 

REGISTRATION NUMBER: 2 6581 

REFERENCE/ DOCKET NUMBER: 91532-PCT 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: 202-861-3000 

TELEFAX: 202-822-0944 

TELEX: 6714 627 CUSH 
INFORMATION FOR SEQ ID NO: 4: 
; SEQUENCE CHARACTERISTICS: 

; LENGTH: 534 amino acids 

; TYPE: AMINO ACID 

STRANDEDNESS : single 
; TOPOLOGY: linear 

MOLECULE TYPE: protein 
PCT-US92-00282-4 

Query Match 21.9%; Score 606; DB 5; Length 534; 

Best Local Similarity 31.3%; Pred. No. l.le-55; 

Matches 163; Conservative 94; Mismatches 218; Indels 46; Gaps 13; 

Qy 8 LLVGFLLPGVLLS EAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKRGP 62 

I III III | : | : | : | | | : | | : I II : I I 
Db 12 LATGLLL LLSVQPWAESGKVLWPT-DGSPWLSMREALRELHARGHQAWLT P 63 

Qy 63 FMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLG GRGKFENLL 111 

: | | | | : : : : | I : I I : III I : I : I I 

Db 64 EVNMHIKEEKFFTLTAYAVPWT-QKEFDR VTLGYTQGFFETEHLLKRYSRSMA 115 

Qy 112 NVLEYLALQCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILST 168 

II I I I : : : : I : I I : I : : : I : : I : I II 
Db 116 IMNNVSLALHRCCVELLHNE^IRHLNATSFDWLTDPWLCGAVTAKYLSIPAVFFWRY 175 

Qy 169 SFGSLEF GLPI PLSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTI 225 

1:1 lllhl :: I I I I I I I I I : : : : 

Db 176 IPCDLDFKGTQCPNPSSYIPKLLTTNSDHMTFLQRVKNMLYPLALSYICHTFSAPYASLA 235 

Qy 226 KEHFTEGSRPV-LSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLE 284 

II ||: I : I : I II I : I I : : I I I : : I I : I I : I : I 

Db 236 SELF QREVSWDLVSYASVWLFRGDFVMDYPRPIMPNMVFIGGINCANGKPLSQEFE 292 

Qy 285 NFIAKFGDSGFVLVTLGSMVNTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHLAA 344 

: I I : I I : : I I I I I : : : : I : I I I : I : : : : I I 

Db 293 AYINASGEHGI WFSLGSMVSEI PEKKAM-AIADALGKI PQTVLWRYTGT RPSNLAN 348 

Qy 34 5 NVKIVDWLPQSDLLAHPSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAK 4 04 

I : I I I I I : I I I I I | | : I I I : : I : I : I I I I I : I I I I I I : I hi I 
Db 349 NTILVKWLPQNDLLGHPMTRAFITHAGSHGVYESICNGVPMVMMPLFGDQMDNAKRMETK 4 08 



Qy 



405 KFGVSIQLKKLKAETLALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQ 464 



Db 



ii ii * i * • ii • i • i • i 

409 GAGVTLNVLEMTSEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDIAVFWVEFVMR 468 



Qy 



4 65 TGGATHLKPYVFQQPWHEQYLFDVFVFLLGLTLGTLWLCGK 505 



Db 



it I i • i i ■ • ii i i i • i • • i 

469 HKGAPHLRPAAHDLTWYQYHSLDVIGFLLAWLTVAFITFK 509 



RESULT 9 

PCT-US92-00282-7 

; Sequence 7, Application PC/TUS9200282 

; GENERAL INFORMATION: 

; APPLICANT: OWENS, IDA S. 

APPLICANT: RITTER, JOSEPH K. 

TITLE OF INVENTION: THE GENETIC LOCUS UGT1 AND A MUTATION 
TITLE OF INVENTION: THEREIN. 
NUMBER OF SEQUENCES: 40 
CORRESPONDENCE ADDRESS: 
; ADDRESSEE: CUSHMAN DARBY & CUSHMAN 

STREET: 1615 L STREET, N.W. 
CITY: WASHINGTON 
STATE: D.C. 
COUNTRY: U.S.A. 
; ZIP: 20036-5601 

; COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
; SOFTWARE: Patentln Release #1.0, Version #1.25 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US92/ 002 82 
FILING DATE: 19920110 
CLASSIFICATION: 435 
ATTORNEY/AGENT INFORMATION: 
; NAME: SCOTT, WATSON T. 

; REGISTRATION NUMBER: 2 6581 

REFERENCE/ DOCKET NUMBER: 91532-PCT 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 202-861-3000 
TELEFAX: 202-822-0944 
; TELEX: 6714627 CUSH 

INFORMATION FOR SEQ ID NO: 7: 
SEQUENCE CHARACTERISTICS : 
LENGTH: 529 amino acids 
TYPE: AMINO ACID 
; STRANDEDNESS: single 

; TOPOLOGY: linear 

MOLECULE TYPE: protein 
PCT-US92-00282-7 

Query Match 21.7%; Score 600; DB 5; Length 529; 

Best Local Similarity 28.3%; Pred. No. 4.8e-55; 

Matches 155; Conservative 105; Mismatches 222; Indels 66; Gaps 14; 



Qy 



8 LLVGFL LPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKRGPFM 64 



I ill I I : I : 1:1 : I I hi I : : I : I I : : :*l : 
Db 10 LPAGFLFLVLWGSVLGD--KLLWPQ-DGSHWLSMKEIVEHLSERGHDIWL V 59 

Qy 65 PDFK KEEKS YQVISWLAP EDHQREFKKSFDFFLEETLGGRGKFENLLNVLE 115 

I : I I I : I : I I I : : : : I : I : : 

Db 60 PEVNLLLGESKYYRRKSFPVPYNLEELRTRYRSFGNNHFAASSPLMAPLREYRNNMIVID 119 

Qy 116 YLALQCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEF 175 

II : | : | | : : I : : I I I I : : II I 

Db 120 MCFFSCQSLLKDSATLSFLRENQFDALFTDPAMPCGVILAEYLKLPSIYLFRGFPCSLEH 179 

Qy 176 — GLPIPLSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGS 233 

I I : I I I I I : : I I I I I : I I : : : I : I 

Db 180 IGQSPSPVSYVPRFYTKFSDHMTFPQRLANFI ANILENYL-YHCLYSK 226 

Qy 234 RPVLSHLLLKAE LWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDL 283 

: I : III: I I : I I I : : I I : : I I : : : I I I : I : 

Db 227 YEILASDLLKRDVSLPALHQNSLWLLRYDFVFEYPRPVMPNMIFIGGTNCKKKGNLSQEF 286 

Qy 284 ENFIAKFGDSGFVLWLGSMVNTCQNPEIFKEmNAFAHLPQGVIWKCQCSHWPKDVHLA 343 

I : : I : I I : : I I I I I : : hi : I I : : h : : : I I 

Db 287 EAYVNASGEHGIWFSLGSMVSEI PEKKAM-EIAEALGRI PQTLLWRYTGT RPSNLA 342 

Qy 344 ANVKIVDWLPQSDLI^PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEA 403 

I : I I I I 1 : I I I i I I I : I I I : I I I : I I I I I : I I I I I I : I hi 

Db 343 KNTILVKWLPQNDLLGHPKARAFITHSGSHGIYEGICNGVPMVMMPLFGDQMDNAKRMET 402 

Qy 404 KKFGVSIQLKKLKAETLALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVL 463 

: | | : : : : : | : | : I : : : I I I : I : : hi I h : : h 

Db 403 RGAGWLNVLEMTADDLENALKTVINNKSYKENIMRLSSLHKDRPIEPLDIAVFWEYVM 462 

Qy 464 QTGGATHLKPYVFQQPWHEQYLFDVFVFLLGLTLGTLWL CGKLLGMAVWWLR 515 

: lllhl h : : II I I I : I : : : I I I 
Db 463 RHKGAPHLRPAAHDLTWYQYHSLDVIGFLLAIVLTWFIVYKSCAYGCRKCFG 515 

Qy 516 GARKVKET 523 

I : I h : 
Db 516 GKGRVKKS 523 



RESULT 10 
US-09-813-918-2 

; Sequence 2, Application US/09813918 

; Patent No. 6383789 

; GENERAL INFORMATION: 

; APPLICANT: WEBSTER, Marion et al . 

; TITLE OF INVENTION: ISOLATED HUMAN DRUG-METABOLIZING 

; TITLE OF INVENTION: PROTEINS, NUCLEIC ACID MOLECULES ENCODING HUMAN 
; TITLE OF INVENTION: DRUG-METABOLIZING PROTEINS, 
; TITLE OF INVENTION: AND USES THEREOF 
; FILE REFERENCE: CL001175 

; CURRENT APPLICATION NUMBER: US/09/813,918 
; CURRENT FILING DATE: 2001-03-22 
; NUMBER OF SEQ ID NOS : 4 

SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID ,NO 2 
; LENGTH: 4 54 



TYPE: PRT 
ORGANISM: Human 
US-09-813-918-2 



Query Match 18.8%; Score 521; DB 4; Length 454; 

Best Local Similarity 28.7%; Pred. No. le-46; 

Matches 128; Conservative 79; Mismatches 173; Indels 66; Gaps 13; 

Qy 118 ALQCSHFLNRKDIMDSLKNENFDMVIVET FDYCPFLIAEKLGKPFVAILSTSFGS 172 

I : I I : : I I I : I : : : : : II = 
Db 30 AAE YSHWMNMKT I LKELVQRGHEVTVLAS SAS I LFD PNDAST 71 

Qy 173 LEFGLPIPLSYVPV-FRSLLTDHMDFWGRVK--NFLMFFS : FCR — 212 

| : | : I I | : : : : I : : : I : : I I 11 = 

Db 72 LKFEV-YPTSLTKTEFENIIMQQVKRWSDIRKDSFWLYFSQEQEILWELYDIFRNFCKDV 130 

Qy 213 RQQHMQST FDNT I KEH- FTEGS RP-VLSHLLLKAELWFINSDFAFD 256 

: : : I I : I I III : I I : : I : : : : I 

Db 131 VSNKKWIKKLQELRFDIVFADAVFPCGELLAALLNIRPTTLFETMGKADIWLMRNPWSFQ 190 

Qy 257 FARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTLGSMVNTCQNPEIFKEM 316 

I | I I I : I I I | | | | : I : : : I I : | : : | | : : I I I : : : I : 

Db -191 FPHPFLPNVDFVGGFHCKPAKPLPKEMEEFVQSSGENGVWFSLGSVISN-MTAERANVI 249 

Qy 317 NNAFAHLPQGVIWKCQCSHWPKDVHLAANVKIVDWLPQSDLLAHPSIRLFVTHGGQNSIM 376 

| | : | | | : | : : I I I : : I : I I : I I I I I I I : I I I I I I 
Db 250 ATALARI PQKVLWRFDGN KPDALGLNTRLYKWIPQNDLLGHPKTRAFITHGGANGIY 306 

Qy 377 EAIQHGVPMVGIPLFGDQPE>mWVEAKKFGVSIQLKKLKAETLALKMKQIMEDKRYKSA 436 

I I I II : I I I I II I I I I I : I : : : I I I : : : I : I : : I I I 

Db 307 EAIYHGIPMVGIPLFFDQPDNIAHMKAKGAAVRLDFNTMSSTDLLNALKTVINDPLYKEN 366 

Qy 437 AVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPWHEQYLFDVFVFLLGLT 496 

: II I : I I I I I : I : I I I I : I : : I I III 

Db 367 IMKLSRIQHDQPVKPLDRAVFWIEFVMPHKGAKHLRVAAHDLTWFQYHSLDVIGFLLACV 426 

Qy 497 LGTLWLCGKLLGMAVWWLRGARKVKE 522 

: : : I I : I I I I : 

Db 427 ATVI FI I T KFCLFCFW — KFARKGKK 450 



RESULT 11 
US-09-813-918-3 

; Sequence 3, Application US/09813918 

; Patent No. 6383789 

; GENERAL INFORMATION: 

; APPLICANT: WEBSTER, Marion et al. 

; TITLE OF INVENTION: ISOLATED HUMAN DRUG-METABOLIZING 

; TITLE OF INVENTION: PROTEINS, NUCLEIC ACID MOLECULES ENCODING HUMAN 

TITLE OF INVENTION: DRUG-METABOLIZING PROTEINS, 
; TITLE OF INVENTION: AND USES THEREOF 
; FILE REFERENCE: CL001175 

; CURRENT APPLICATION NUMBER: US/09/813, 918 
; CURRENT FILING DATE: 2001-03-22 
; NUMBER OF SEQ ID NOS : 4 

; SOFTWARE: Fast SEQ for Windows Version 4.0 
; SEQ ID NO 3 



LENGTH: 28 8 
TYPE: PRT 
ORGANISM: Human 
US-09-813-918-3 

Query Match 18.2%; Score 503.5; DB 4; Length 288; 

Best Local Similarity 35.9%; Pred. No. 3.7e-45; 

Matches 104; Conservative 53; Mismatches 126; Indels 7; Gaps 4; 

Qy 234 RP-VLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGD 292 

II I : ||::| : : ::| | I Ml :||| II ||:|:::| |: |: 
Db 1 RPTTLFETMGKADIWLMRNSWSFQFPHPFLPNVDFVGGFHCKPAKPLPKEMEEFVQSSGE 60 

Qy 293 SGFVTjWLGSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPK^ 352 

: I I : : I I I : : : I : I I : I I I : I : : I I I : : I : 
Db 61 NGVWFS LGS VI SN-MTAERANVI ATALAKI PQKVLWRFDGN KPDALGLNTRLYKWI 116 

Qy 353 PQSDLIJUiPSIRLFWHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQL 412 

11:11111 I I : I I I I I I I I I I I : I I I I II I I I I I : I : :: I I I : 

Db 117 PQNDLLGHPKTRAFITHGGANGIYEAIYHGIPMVGIPLFFDQPDNIAHMKAKGAAVRLDF 176 

Qy 413 KKLKTVETLALKMKQIMEDKRYKSTVAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLK 472 

: : I : I : : I I I : I I I : I I I I I : I : MM: 

Db 177 NTMSSTDLLNALKTVINDPLYKENIMKLSRIQHDQPVKPLDRAVFWIEFVMPHKGAKHLR 236 

Qy 473 PWFQQPWHEQYLFDVFVFLLGLTLGTLWLCGKLLGMAWWLRGARKVKE 522 

I : : I I M I : : : I I : I M I : 

Db 237 VAAHDLTWFQYHSLDVI GFLLACVATVI FIITKFCLFCFW — KFARKGKK 284 



RESULT 12 
US-09-305-856B-18 

; Sequence 18, Application US/09305856B 

; Patent No. 6479236 

; GENERAL INFORMATION: 

; APPLICANT: Penny, Laura 

APPLICANT: Galvin, Margaret 
; TITLE OF INVENTION: Genotyping the Human 

; TITLE OF INVENTION: UDP-Glucuronosyltransf erase 1 (UGT1). Gene 

; FILE REFERENCE: 4389-7 (formerly SEQ-17CIP) 

; CURRENT APPLICATION NUMBER: US/09/305, 856B 

; CURRENT FILING DATE: 1999-05-05 

; PRIOR APPLICATION NUMBER: 60/084,807 

; PRIOR FILING DATE: 1998-05-07 

; NUMBER OF SEQ ID NOS : 124 

SOFTWARE: FastSEQ for Windows Version 3.0 
; SEQ ID NO 18 

LENGTH: 24 5 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-305-856B-18 

Query Match 13.7%; Score 380; DB 4; Length 245; 

Best Local Similarity 34.8%; Pred. No. 4.5e-32; 

Matches 78; Conservative 49; Mismatches 93; Indels 4; Gaps 2; 



Qy 2 82 DLENFIAKFGDSGFVLVTLGSMVNTCQNPEIFKEMNNAF7VHLPQGVIWKCQCSHWPKDVH 341 



: I : I I : I I : : I I I I I : : : : I : I I I : I : : : : 

Db 1 EFEAYINASGEHGIWFSLGSMVSEI PEKKAM-AIADALGKI PQTVLWRYTGT RPSN 56 

Qy 342 LAANVKIVDWLPQSDLLAHPSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRV 401 

III : I I I I I : I I I I I I I : I I I : : I : I : I I I I I : I I I I I I : I I : 
Db 57 LANNTILVKWLPQNDLLGHPMTRAFITHAGSHGVYESICNGVPMVMMPLFGDQMDNAKRM 116 

Qy 402 EAKKFGVSIQLKKLKAETLALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDH 461 

II I I :::::: I I : I :: I I I I : I : : I : I I I : : 

Db 117 ETKGAGVTLNVLEMTSEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEF 176 

Qy 462 VLQTGGATHLKPYVFQQPWHEQYLFDVFVFLLGLTLGTLWLCGK 505 

I : : 1111:1 I : : : II I I I : I : : I 

Db 177 VMRHKGAPHLRPAAHDLTWYQYHSLDVIGFLLAWLTVAFITFK 220 



RESULT 13 
US-08-942-012B-32 

Sequence 32, Application US/08942012B 
Patent No. 6235278 
GENERAL INFORMATION: 
APPLICANT: Miller, Lois K. 
APPLICANT: Lu, Albert 
APPLICANT: Dierks, Peter 
APPLICANT: Black, Bruce 

TITLE OF INVENTION: Biological Insect Control Agents Expressing 
TITLE OF INVENTION: Insect-Specific Toxin Genes, Methods and Compositions 
FILE REFERENCE: 28-96a 

CURRENT APPLICATION NUMBER: US/08/942 , 012B 
CURRENT FILING DATE: 1997-10-01 
PRIOR APPLICATION NUMBER: 08/729,606 
PRIOR FILING DATE: 2000-10-01 
NUMBER OF SEQ ID NOS: 33 
SOFTWARE: Patent In Ver . 2.0 
SEQ ID NO 32 
LENGTH: 515 
TYPE: PRT 

ORGANISM: Spodoptera littoralis nuclear polyhedrosis virus 
US-08-942-012B-32 

Query Match 12.5%; Score 346.5; DB 3; Length 515; 

Best Local Similarity 22.5%; Pred. No. 5.9e-28; 

Matches 122; Conservative 102; Mismatches 211; Indels 107; Gaps 18; 

Qy 2 0 S EAAKI LT I - STVGGSHYLLMDRVSQI LQDHGHNVTML NHKRGPFMPDFKKEEKS 73 

I I :: I : I I I : I I I I :: :: III 

Db 16 SAAVRVLCMFPTPSYSHQTVFDVYVNALLRRGHSLWISPKIHNHNHG 63 

Qy 74 YQVI SWLAPEDHQREFKKS FD FF LEETLGGRGK FENLLNV 113 

I : I I | | | : : : | : I I : 

Db 64 HRHHRHENLTEIDVGSVTNNFFKRLLQDSKVSRKRGIVSDSSTVTRVNYLGL 115 

Qy 114 LEYLALQCSHFLNRKDIMDSLK-NENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGS 172 

: : I I : : | : I : I I : : : : I I I : : : I : I : I : 

Db 116 ARMISAQFEH EQVKRLLRSNQTFDVIVIEAFVSYPLILSYFFKDT PVIQISSGHGT 171 



Qy 



173 LEFGLPIPLSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTE- 2 31 



I : : I I : : : | : : I : : : I I I 

Db 172 AE N FETMGAVARH PVY YPNMWRDRFKGL S VWQT VRQVFT E I 212 

Qy 232 GSR-PVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVG 2 69 

I I : I : I : I : I : II I I : I hi 

Db 213 RLYMEFSQLDADQSAMMKRQFGSKVPDVDALRKNVHiyiMFVNTHPVFDTNRPVPSNVQYLG 272 

Qy 27 0 GL-MEKPIKPVPQDLENFIAKFGDS GFVLVTLGSMVNTCQNPEIFKEMNNAFAH 322 

I : : : : I : : : I : I : I : : I I h I I I I : : III 

Db 273 GIHIDPAVTSVADEIDNDLAEFLENSTMGWYVSLGSSVRA SDMDSNMLNVFVETFR 329 

Qy 323 -LPQGVIWKCQCSHWPKDVHLAANVKIVDWLPQSDLIAHPSIRLFVTHGGQNSIMEAIQH 381 

: I I : I I I | : : : | | | III : I I : : : : I : I I I I III 

Db 330 SIPYRVLWKVDKSDKIFD-NIPSNVLIQRWFPQRRVLKHRNVKVFITQGGVQSTDEAIDA 388 

Qy 382 GVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETLALKMKQIMEDKRYKSAAVAAS 441 

I I I I I : I : I I I I : I I : : I I :::::: | | : : 

Db 38 9 GVPMFGVPIMGDQFYNVYMYETYGIGRGVDTLTVDARQLTEIVMDVADNEKYKNGTLWLR 44 8 

Qy 442 VI LRSHPLS PTQRLVGWI DHVLQTGGA-THLKP YVFQQPWHEQYLFDVFVFLLGLTLGTL 500 

: h I :: I : : I I : I I I I : : : | | : : : | : | : : 

Db 449 DAIMDQPMRPLEKAWYTEHVARRKGAKKHLGTRAANVTYSKYAMFDLILPML-ITIFST 507 

Qy 501 WL 502 

: I 

Db 508 YL 509 



RESULT 14 
US-08-942-012B-33 

Sequence 33, Application US/08942012B 
Patent No. 6235278 
GENERAL INFORMATION: 
APPLICANT: Miller, Lois K. 
APPLICANT: Lu, Albert 
APPLICANT: Dierks, Peter 
APPLICANT: Black, Bruce 

TITLE OF INVENTION: Biological Insect Control Agents Expressing 
TITLE OF INVENTION: Insect-Specific Toxin Genes, Methods and Compositions 
FILE REFERENCE: 28-96a 

CURRENT APPLICATION NUMBER: US/08/ 942 , 012B 
CURRENT FILING DATE: 1997-10-01 
PRIOR APPLICATION NUMBER: 08/729,606 
PRIOR FILING DATE: 2000-10-01 
NUMBER OF SEQ ID NOS : 33 
SOFTWARE: Patentln Ver. 2.0 
SEQ ID NO 33 
LENGTH: 460 
TYPE: PRT 

ORGANISM: Lacanobia oleracea granulovirus 
US-08-942-012B-33 

Query Match 12.4%; Score 342.5; DB 3; Length 460; 

Best Local Similarity 24.7%; Pred. No. 1.3e-27; 

Matches 120; Conservative 93; Mismatches 215; Indels 57; Gaps 18; 



Qy 



7 LLLVGFLLPGVLLSEAAKILTI-STVGGSHYLLMDRVSQILQDHGHNVTMLNHKRGPFMP 65 



: I I : : : I I I I : I II: I Mill:: I 

Db 5 I LLLALAVERI L CANI LCVFPT PAYSHQS VFSAYI DKL S WAGHNVT VI T P 54 

Qy 66 DFKKEEKSYQVI SWLAPEDHQREFK KSFDFFLEETLGGRGKFENLLNVLEYLALQ 12 0 

: : : | | : | | : I I : I I : : I : I : : 
Db 55 MPRAVDHVHQWSSLSVHYFNNLIKNSTMIKKRGWADETTVTKENYMGLINLV 108 

Qy 121 CSHFLNRKDIMDSLKNE — NFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLP 178 

: I : : : I : I : I I : : : I I : : : III I 

Db 109 -AHEIKSPNVTRLLRNKGNKFDLIVCEA— YVSYILV FGAIYDAPVIQFSSGYA 159 

Qy 179 I PLS YVPVFRSLLTDHM DFWG RVKNF — LMFFSFCRRQ-QHMQSTFDNTIKEHF 22 9 

I I : : I : : I : : I I I I I : : : : : : : I : I I 

Db 160 IPENFETVGGEVARNHIKHPNIWRSDFSKSNFEQLMTENYLKNEWALLEKEQENMLKRDF 219 

Qy 230 TEGSRPVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGL-MEKPIKPVPQDLENFIA 288 

I : I : : I I I II I : I I : I I : : : I I I : I : 

Db 220 — GYHHDMCQLKSRVLMLFINVPAVFDNNRDVSNNIQYLGGIHLKKPRTVRDSRLLSFME 277 

Qy 2 89 KFGDSGFVLVTLGSMVNTC-QNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHL A 343 

I | : | | : : : : I I : I I : I I I : I I : 

Db 278 K- - HH 1 1 VYAS FG SGI DVLNMDAN L I AE FVRVFN SIP YAVLWKVD S S IHLKHNIS 330 

Qy 344 ANVKIVDWLPQSDLLAHPSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEA 403 

: I I I I I I : I I I I :: I : I I I I I I : I I I I : I I I : I I I hi 

Db 331 SNVHTQSWFPQRDVLNHPHIKVFITQGGVQSTDEAVNSGVPMIGIPIMGDQFYNVRRYTE 390 

Qy 4 04 KKFGVSIQLKKLKAETLALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVL 463 

| : : : | : | | I : I : : : I I : : : : I : I : : : : : : I I 

Db 391 LGI GEKVNI LRLEEEGLDRKI KNLVHNKS YELNI KRLNLFI SDTPVKPLRKALWFTN YVL 450 

Qy 464 QTGGA 4 68 

I 

Db 451 RNKDA 455 



RESULT 15 
US-08-942-012B-24 

Sequence 24/ Application US/08942012B 
Patent No. 6235278 
GENERAL INFORMATION: 
APPLICANT: Miller, Lois K. 
APPLICANT: Lu, Albert 
APPLICANT: Dierks, Peter 
APPLICANT: Black, Bruce 

TITLE OF INVENTION: Biological Insect Control Agents Expressing 
TITLE OF INVENTION: Insect-Specific Toxin Genes, Methods and Compositions 
FILE REFERENCE: 28-96a 

CURRENT APPLICATION NUMBER: US/ 08/ 942 , 012B 
CURRENT FILING DATE: 1997-10-01 
PRIOR APPLICATION NUMBER: 08/729,606 
PRIOR FILING DATE: 2000-10-01 
NUMBER OF SEQ ID NOS : 33 
SOFTWARE: Patentln Ver. 2.0 
SEQ ID NO 2 4 
LENGTH: 515 
TYPE: PRT 



; ORGANISM: Helicoverpa zea nuclear polyhedrosis virus 
US-08-942-012B-24 



Query Match 11.6%; Score 321; DB 3; Length 515; 

Best Local Similarity 24.4%; Pred. No. 3.2e-25; 

Matches 129; Conservative 88; Mismatches 246; Indels 66; Gaps 21; 

Qy 6 VLLLVGFLLPGVLLSEAAKILTISTVGG-SHYLLMDRVSQILQDHGHNVTMLNHKRGPFM 64 

: I I I I I I I I : I : I I : I | | : : : : I I I : : II 
Db 8 MLLLVLFL — SVL — DGARILCVFPVPSYSHHAVFEAYTNALASRGHTIVRIT PF- 58 

Qy 65 PDFKKEEKS YQVI SWLAPEDH QREFKKSFDFFLEETLGGRGKFENLLNVLEYL 117 

II::: : I : I I I I II : I 

Db 59 PTKKNDSSNVTDVDVSLSKDYFKSLVDRSRLFKK RGVI SETS S VTARN 106 

Qy 118 ALQCSHFLNRKDIMDSLK NENFDMVIVETF-DYCPFLIAEKLGK-PFVAI 165 

: | | : : : | : : | | | : : : | | | | | : : I I : I 

Db 107 Y I S L VHML I DQ F S VE S VRQ LIES NN VFD L LVT EAF L D Y- P L VF S H L FGD VP VI Q I S S GHA 165 

Qy 166 LSTSFGSLEFGLPIPLSYVPVFRSLLTDHMDFWGRV KN FLMFFS FCRRQQHMQST F 221 

|: :| :: |: I ::|: ::: I : ::: I I 
Db 166 1AENFETMGAVSRHPIYYPNLWRNKF-QNLNVWEIITEIYTELVLYLEFARLADE 219 

Qy 222 DNTIKEHFTEGSRPVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPV — 279 

: I : I : I : : I I : I : I I I I : I : I : I I I 
Db 220 QTKMLRHQFGPNTPSVEELRQRVQLLFVNTHPLFDNNRPVPPSVQYLGSLHLDRNNDVDE 279 

Qy 280 PQDLENFIAKF GDSGFVLVTLGSMVNTC-QNPEIFKEMNNAFAHLPQGVIWKCQCSH 335 

I : : : : I : I I I : I : : : I I I I I I : : I I 

Db 280 QQTMDYNLMQFLNNSTNGWYVSFGTSIRVSDMDDEFLFEFITAFKQLPYNILWK — TDG 337 

Qy 336 WPKDVTiLAANVKIVX^WLPQSDLl^PSIRLFvTHGGQNSIMEAIQHGVPMVGIPLFGDQP 395 

I : I II I I I I : I I : : I I I I I I III I I : : I I I III 

Db 338 MPMEHVLPKNVLTQTWLPQHHVLKHSNWAFVTQGGMQSTDEAIDACVPLIGIPFMGDQA 397 

Qy 396 ENMVRVEAKKFGVS IQLKKLKAETL — ALKMKQIMEDKRYKSAAVAAS VI LRSHPLS PTQ 453 

I : I I : : I : I I : : II I : I : 

Db 398 YNTNKYEELGIGRNLDPVTLTSHILVSAVLDVTVNNKSRYTDNIKALNRSTNYRTRKPME 457 

Qy 454 RLVGWIDHVLQTGGATHLKPYVFQQPWHEQYLFDVFV FLLGLTLG 4 98 

: : : : I I : I II : : I : I : I I I : II 

Db 458 KAIWYTEHVIDNGKNPILKTKAANVSYSKYYMSDIIVPVITFLVMTHLG 506 



Search completed: May 7, 2004, 17:33:56 
Job time : 23 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 



May 1, 2004, 17:29:09 ; Search time 21 Seconds 

(without alignments) 
2395.627 Million cell updates/sec 

US-10-017-867A-282 
2768 

1 MAGQRVLLLVGFLLPGVLLS GKLLGMAVWWLRGARKVKET 523 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 283366 seqs, 96191526 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



283366 



Database : 



PIRJ78:* 
pirl : * 
pir2:* 
pir3 : * 
pir4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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RESULT 1 
JC5423 

2-hydroxyacylsphingosine 1-beta-galactosyltransf erase (EC 2.4.1.45) - human 
N;Alternate names: UDP-galactose : ceramide galactosyltransf erase 
C; Species: Homo sapiens (man) 

C;Date: 10-Jun-1997 #sequence_revision 18-Jul-1997 #text_change 26-Aug-1999 
C;Accession: JC5423 
R;Kapitonov f D. ; Yu, R.K. 

Biochem. Biophys . Res. Commun. 232, 449-453, 1997 

A; Title: Cloning, characterization, and expression of human ceramide 
galactosyltransf erase cDNA. 

A; Reference number: JC5423; MUID : 97242209 ; PMID: 9125199 

A;Accession: JC5423 

A;Molecule type: mRNA 

A; Residues: 1-541 <KAP> 

A;Cross-references: GB:U62899 

A; Experimental source: fetal glioma cell 

C; Comment: This enzyme catalyzes the final step of galactosyl ceramide synthesis. 
C; Genetics : 
A; Gene: cgt 



C; Superf amily : glucuronosyltransf erase 

C; Keywords: glycoprotein; glycosyltransf erase ; hexosyltransf erase 
F;472-492/Domain: hydrophobic #status predicted <HYD> 

F;538-540/Region: endoplasmic reticulum retention signal #status atypical 
F;78, 333, 442/Binding site: carbohydrate (Asn) (covalent) ttstatus predicted 

Query Match 25.3%; Score 700; DB 2; Length 541; 

Best Local Similarity 32.2%; Pred. No. 6.5e-47; 

Matches 166; Conservative 95; Mismatches 185; Indels 70; Gaps 12; 

Qy 13 LLPGVLLSE^AKILTISTV-GGSHYLLMDRVSQILQDHGHN-VTMLNHKRGPFMPDFKKE 7 0 

I | : : : | | | | : : : I I : : : I : I I : I : I : I 
Db 11 LWSAVGIAKAAKI I IVPPIMFESHMYI FKTLASALHERGHHT VFLLSEGRD 61 

Qy 71 EKS YQVI SWLAPEDHQ — REFKKSF DFFLEETL GGRGKFENLLNVLEYLAL 119 

: I I : I : : I III:: II I : : I : : 

Db 62 IAPSNHYSLQRYPGIFNSTTSDAFLQSKMRNIFSGRLTAIELFDILDHYTK 112 

Qy 12 0 QCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPI 179 

I : : : I I I I I : : : I : I I hi II : : : III 

Db 113 NCDMMVGNHALIQGLKKEKFDLLLVDPNDMCGFVIAHLLGVKYAVFSTGLWYPAEVGAPA 172 

Qy 180 PL S YVP YFRS LLT.DHMD FWGRVKN FLMFFS FCRRQQHM QSTFD 222 

I I : I I I I I I I I I I : I : I I II: : I I : I : I 

Db 173 PLAYVPEFNSLLTDRMNLLQRMKNTGVYLISRLGVSFLVLPKYERIMQKYNLLPEKSMYD 232 

Qy 223 NTIKEHFTEGSRPVLSHLLLK7VELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQD 282 

I : : I I : : I I : I I I I I I I I I I I : : I I I : I : I 

Db 233 LVliGSSLWMLCTDVALEFPRPTLPNWYVGGILTKPASPLPED 275 

Qy 283 LENFI7VKFGDSGFVLVTLGSMVNTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHL 342 

I : : : : I I I I I : I : I : : I : : I I I I I I I : I I : I 

Db 276 LQRWVNGANEHGFVLVSFGAGVKYL-SEDIANKLAGALGRLPQKVIWRFS GPKPKNL 331 

Qy 343 AANWIV13WLPQSDLIAHPSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVE 402 

I I : : : I I I I : I I I I I : I I : I I I I I I I : I I I I : I I I I : I I I : I I I : 
Db 332 GNNTKLIEWLPQNDLLGHSKIKAFVSHGGLNSIFETMYHGVPWGIPVFGDHYDTMTRVQ 391 

Qy 403 AKKFGVSIQLKKLKAETLALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHV 462 

| | | : : : I : : I : : : : : I : I I I : I I I : I I I : : 
Db 392 AKGMGILLEWKTVTEKELYEALVKVINNPSYRQRAQKLSEIHKDQPGHPVNRTIYWIDYI 451 

Qy 4 63 LQTGGATHLKPYVFQQPWHEQYLFDV-FVFLLGLTL 4 97 

: : II II: II : : : I I : I I I I I I 

Db 452 IRHNGAHHLRAAVHQISFCQYFLLDIAFVLLLGAAL 487 



RESULT 2 
A48801 

2-hydroxyacylsphingosine 1-beta-galactosyltransf erase (EC 2.4.1.45) precursor - 
rat 

N;Alternate names: UDPgalactose-ceramide galactosyltransf erase 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 07-Apr-1994 #sequence_revision 18-Nov-1994 #text_change 05-Nov-1999 
C;Accession: A48801; 156576; S63480 
R;Schulte, S.; Stoffel, W. 

Proc. Natl. Acad. Sci. U.S.A. 90, 10265-10269, 1993 



A; Title: Ceramide UDPgalactosyltrans f erase from myelinating rat brain: 
purification, cloning, and expression. 

A; Reference number: A48801; MUID: 94052143; PMID:7694285 

A;Accession: A48801 

A; Status: preliminary 

A; Molecule type: mRNA; protein 

A; Residues: 1-541 <SCH> 

A; Cross-references: GB:L21698; NID:g437665; PIDN : AAA16108 . 1 ; PID:g437666 
A; Experimental source: brain 

A;Note: sequence extracted from NCBI backbone (NCBIN : 139520 , NCBIP : 139522 ) 
R;Stahl, N. ; Jurevics, H.; Morell, P.; Suzuki, K. ; Popko, B . 
J. Neurosci. Res. 38, 234-242, 1994 

A;Title: Isolation, characterization, and expression of cDNA clones that encode 

rat UDP-galactose : ceramide galactosyltransf erase . 

A; Reference number: 156576; MUID : 94358923 ; PMID: 7521399 

A;Accession: 156576 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-541 <RES> 

A; Cross-references: EMBL:U07683; NID:g464025; PIDN: AAA50212 . 1 ; PID:g464026 

R;Schulte, S.; Stoffel, W. 

Eur. J. Biochem. 233, 947-953, 1995 

A; Title : UDP galactose : ceramide galactosyltransf erase and glutamate/aspartate 
transporter: copurif ication, separation and characterization of the two 
glycoproteins . 

A; Reference number: S63480; MUID : 96085162 ; PMID: 8521863 
A;Accession: S63480 
A;Molecule type: protein 

A; Residues: 21-28, 1 A 1 , 30-31, ' Q ' , 33-39 ; 73-77, 'X ' , 79-87 ; 155-166, ' Q ',168-173 ; 315- 
322; 330-331, 'EX 1 ,334-338, 1 Q 1 , 340-353; 416-423; 510-515 <SUL> 
A; Experimental source: brain 
C; Function: 

A; Description: transfers galactose from UDP-galactose to ceramide 
C; Superf amily : glucuronosyltransf erase 

C;Keywords : glycoprotein; glycosyltransf erase; hexosyltransf erase 
F;l-20/Domain: signal sequence #status predicted <SIG> 

F;21-541/Product : 2-hydroxyacylsphingosine 1-beta-galactosyltransf erase #status 
experimental <MAT> 

F;78, 333/Binding site: carbohydrate (Asn) (covalent) #status experimental 

Query Match 24.7%; Score 684; DB 2; Length 541; 

Best Local Similarity 32.2%; Pred. No. 1.2e-45; 

Matches 165; Conservative 94; Mismatches 184; Indels 70; Gaps 12; 

Qy 13 LLPGVLLS EAAKI LTI STV-GGSHYLLMDRVSQI LQDHGHN-VTMLNHKRGPFMPDFKKE 70 

I | : : | | | | : : : . | | : : : I : I I : I : I : I 
Db 11 LWSAVGIARAAKIIIVPPIMFESHLYIFKTLASALHERGHHTVFLLSEGRD 61 

Qy 71 EKSYQVISWLAPEDHQ— REFKKSF -DFFLEETL GGRGKFENLLNVLEYLAL 119 



Db 



62 



| . | . . | III* • II 

IDPSNHYSLQRYPGIFNSTTSDAFLQSKMRNIFSGRLTAVELVDILDHYTK 112 



Qy 



Db 



120 QCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPI 179 

I : : : : I I I I I : : : I : I I I : I I I I • : = III 

113 NCDMMVGNQALIQGLKKEKFDLLLVDPNDMCGFVIAHLLGVKYAVFSTGLWYPAEVGAPA 172 



Qy 



180 PLSYVPVFRSLLTDHMDFWGRVKN- 



FLMFFSFCRRQQH MQSTFD 222 



Db 


173 


PLAYVPEFNSLLTDRMNFLERMKNTGVYLISRMGVSFLVLPKYERIMQKYNLLPAKSMYD 


232 


Qy 


223 


NTIKEHFTEGSRPVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQD 

1 : : 1 1 : : 1 1 : 1 1 1 1 1 1 1 1 1 1 1 : : 1 1 1 : 1 * 1 

LVHGSSLWMLCTDVALEFPRPTLPNWYVGGILTKPASPLPED 


282 


Db 


233 


275 


Qy 


283 


LENFIAKFGDSGFVLVTLGSMVNTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHL 

| : : : : | | | I I : 1 : 1 : : i : : 1 Mill!: : 1 : 1 
LQRWVDGAQEHGFVLVSFGAGVKYL-SEDIANKLAG7VLGRLPQKVIWRFSGT KPKNL 


342 


Db 


276 


331 


Qy 


343 


AANVKIVDWLPQSDL]J^PSIRLFVTHGGQNSIMEAIQHGVP^4VGIPLFGDQPENMVRVE 

1 |:::||||:||| 1 : 1 1 1 : : 1 1 1 1 1 1 1 : 1 1 1 1 : 1 1 : 1 lh 

GNNTKLIEWLPQNDLLGHSNIRAFLSHGGLNSIFETMYHGVPWGIPLFGDHYDTMTRVQ 


402 


Db 


332 


391 


Qy 




a^rk r Ti i r"\/"c;TOT WK1 tfAFTT AT MKDTMFDKRYKS AAVAASVTLRSHPLSPTORLVGWIDHV 

|| | : : : : I : : : : : 1 : 1 1 1 : 1 1 1 III:: 
AKGMGILLEWNTVTEGELYDALVKVINNPSYRQRAQKLSEIHKDQPGHPVNRTTYWIDYI 


462 


Db 


392 


451 


Qy 


463 


LQTGGATHLKP YVFQQ PWHEQ YLFDV- FVFLLG 494 
I : II II: II : : : 1 1 : 1 1 1 1 1 
LRHDGAHHLRSAVHQISFCQYFLLDIAFVLLLG 484 




Db 


452 





RESULT 3 
A39092 

glucuronosyltransf erase (EC 2.4.1.17) 1 precursor, bilirubin-specif ic - human 
N;Alternate names: bilirubin UDP-glucuronosyltransf erase 
C; Species: Homo sapiens (man) 

C;Date: 17-Jul-1992 #sequence_revision 17-Jul-1992 #text_change 29-Sep-1999 
C;Accession: A39092; E42586; A48887 
R;Ritter, J.K.; Crawford, J.M. ; Owens, I.S. 
J. Biol. Chem. 266, 1043-1047, 1991 

A;Title: Cloning of two human liver bilirubin UDP-glucuronosyltransf erase cDNAs 
with expression in COS-1 cells. 

A; Reference number: A39092; MUID: 91093210; PMID: 1898728 
A;Accession: A39092 
A; Molecule type: mRNA 
A; Residues: 1-533 <RIT> 

A; Cross-references: GB:M57899; NID:gl84472; PIDN : AAA63195 . 1 ; PID:gl84473 
R;Ritter, J.K.; Chen, F. ; Sheen, Y.Y.; Tran, H.M. ; Kimura, S.; Yeatman, M.T.; 
Owens , I.S. 

J. Biol. Chem. 267, 3257-3261, 1992 

A; Title: A novel complex locus UGT1 encodes human bilirubin, phenol, and other 
UDP-glucuronosyltransf erase isozymes with identical carboxyl termini. 
A; Reference number: A42586; MUID: 92147680; PMID: 1339448 
A; Accession: E42586 

A; Status: preliminary; not compared with conceptual translation 
A; Molecule type: nucleic acid 
A; Residues: 1-288 <RI2> 

A; Cross-references: GB:M84125; NID:g340131; PIDN : AAA61248 . 1 ; PID: g340132 
A;Note: sequence extracted from NCBI backbone (NCBIP : 81433 ) 
R;Ritter, J.K.; Yeatman, M.T.; Kaiser, C; Gridelli, B. ; Owens, I.S. 
J. Biol. Chem. 268, 23573-23579, 1993 

A; Title: A phenylalanine codon deletion at the UGT1 gene complex locus of a 
Crigler-Naj jar type I patient generates a pH-sensitive bilirubin UDP- 
glucuronosyltransf erase . 



A; Reference number: A48887; MUID : 94043159 ; PMID: 8226884 
A/Accession: A48887 

A; Status : preliminary; not compared with conceptual translation 

A; Molecule type: DNA 

A; Residues: 161-170,172-180 <RI3> 

A; Experimental source: liver, Crigler-Na j j ar type I patient 
A;Note: sequence extracted from NCBI backbone (NCBIP : 138934 ) 
C; Genetics : 

A; Gene: GDB : UGT1A1 ; UGT1 

A; Cross-references: GDB: 120007; OMIM: 191740 

A; Map position: 2q37-2q37 

C; Superf amily: glucuronosyltransf erase 

C;Keywords: glycosyltrans f erase; hexosyltransf erase; transmembrane protein 

Query Match 24.5%; Score 67 9; DB 2; Length 533; 

Best Local Similarity 33.8%; Pred. No. 2.8e-45; 

Matches 175; Conservative 93; Mismatches 211; Indels 38; Gaps 13; 



Qy 8 LLVGFLL — PGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKRGPFMP 65 

I : : I II I :: I I I I I I I I I I : I I III I I : : I I 

Db 11 LVLGLLLCVLGPWSHAGKILLI-PVDGSHWLSMLGAIQQLQQRGHEIWL AP 62 

Qy 66 D — - FKKEEKS YQVT SWLAPEDHQRE-FKKS F DFFLEETLGGRGKFENLL 111 

I : : : I : : : I Nihil III:: : = = 

Db 63 DASLYI RDGAFYTLKT Y — PVP FQREDVKES FVS LGHNVFENDS FLQRVI KTYKKIK 117 

Qy 112 NVLEYLALQCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFG 171 

I I I I I : I : : I I I : I I : : : : I I : : I : I II I 
Db 118 KDSAMLLSGCSHLLHNKELMASLAESSFDVMLTDPFLPCSPIVAQYLSLPTVFFLHALPC 177 



Qy 172 SLEF GLPIPLSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEH 228 

I I I I I I I I I I I : I I I I I I I I I : I I : I : I : 

Db 178 SLEFEATQCPNPFSYVPRPLSSHSDHMTFLQRVKNMLIAFSQNFLCDWYSPY-ATLASE 236 



Qy 229 FTEGSRPVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIA 28 8 

I : : II I : I III I : I I : : I I I : I I I : I : I : I : I 

Db 237 FLQ-REVTVQDLLSSASVWLFRSDFVKDYPRPIMPNMVFVGGINCLHQNPLSQEFEAYIN 295 

Qy 289 KFGDSGFVLWLGSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDWLAANVKI 348 

I : I I : : I I I I I : : : : I : I I I : I : : : MM : 

Db 296 ASGEHGI VVFSLGSMVSEIPEKKAM-AIADALGKIPQTVLWRYTGT RPSNLANNTIL 351 



Qy 349 VDWLPQSDLIAHPSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVE7VKKFGV 408 

11111:11111 I I : I I I : : I : I : I I I I I : I I I I I I : I hi I II 

Db 352 VTWLPQNDLLGHPMTRAFITHAGSHGWESICNGVPMVMMPLFGDQMDNAKRMETKGAGV 411 

Qy 4 09 SIQLKKLKAETIJVLKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGA 468 

:::::: I I : I : : I I I I : I : : I : I I I : : I : : I I 

Db 412 TLNVLEMTSEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFVMRHKGA 471 



Qy 4 69 THLKPYVFQQPWHEQYLFDVFVFLLGLTLGTLWLCGK 505 

I I : I | : : : | | | | | : I : : I 

Db 472 PHLRPAAHDLTWYQYHS LDVI GFLLAWLTVAFIT FK 508 



RESULT 
A40467 



4 



glucuronosyltransf erase (EC 2.4.1.17) precursor - rat 
N;Alternate names: UDP-glucuronosyltransf erase isoform 53K 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 07-Feb-1992 #sequence_revision 07-Feb-1992 #text_change 17-Nov-2000 
C;Accession: A40467; A23520; S59627; A26064; 155247 
R;Haque, S.J.; Petersen, D.D.; Nebert, D.W.; Mackenzie, P.I. 
DNA Cell Biol. 10, 515-524, 1991 

A;Title: Isolation, sequence, and developmental expression of rat UGT2B2: the 
gene encoding a constitutive UDP glucuronosyltransf erase that metabolizes 
etiocholanolone and androsterone . 

A; Reference number: A40467; MUID: 91369480; PMID: 1909872 

A; Accession: A404 67 

A; Status : preliminary 

A; Molecule type: DNA 

A; Residues: 1-530 <HAQ> 

R; Jackson, M.R.; Burchell, B. 

Nucleic Acids Res. 14, 779-795, 1986 

A; Title: The full length coding sequence of rat liver androsterone UDP- 
glucuronyltransf erase cDNA and comparison with other members of this gene 
family. 

A; Reference number: A23520; MUID: 86120371 ; PMID: 3003696 
A;Accession: A23520 
A; Molecule type: mRNA 

A; Residues: 31-158, 'E', 160-285, *S', 287-350, 'I', 352-362, ' I ', 364-430, 'E f ,432-530 
<JAC> 

A;Cross-references: GB:X03478; NID:g57452; PIDN : CAA27198 . 1 ; PID:g57453 
A;Note: the authors translated the codon ATT for residue 321 as Asn 
R;Yamashita, A.; Watanabe, M. ; Tonegawa, T . ; Sugiura, T . ; Waku, K. 
Biochem. J. 312, 301-308, 1995 

A;Title: Acyl-CoA binding and acylation of UDP-glucuronosyltransf erase isoforms 

of rat liver: their effect on enzyme activity. 

A; Reference number: S59626; MUID : 96077159; PMID: 7492328 

A;Accession: S59627 

A;Molecule type: protein 

A; Residues: 24-44 <YAM> 

R;Mackenzie, P.I. 

J. Biol. Chem. 261, 14112-14117, 1986 

A;Title: Rat liver UDP-glucuronosyltransf erase . cDNA sequence and expression of 

a form glucuronidating 3-hydroxyandrogens . 

A;Reference number: A26064; MUID : 87033594 ; PMID:2429951 

A;Accession: A26064 

A; Molecule type: mRNA 

A;Residues: 1-430, ' E ', 432-530 <MAC> 

A;Cross-references: GB:J02589; NID:g207582; PIDN :AAA42 314 . 1 ; PID:g207583 

A; Experimental source: hepatic 

C; Super f amily : glucuronosyltransf erase 

C;Keywords: glycosyltransf erase; hexosyltransf erase; transmembrane protein 
F; 1-23/Domain: signal sequence ffstatus predicted <SIG> 
F;24-530/Product : glucuronosyltransf erase #status experimental <MAT> 

Query Match 24.4%; Score 674.5; DB 2; Length 530; 

Best Local Similarity 33.5%; Pred. No. 6.3e-45; 

Matches 170; Conservative 86; Mismatches 219; Indels 33; Gaps 13; 

34 SHYLLMDRVSQILQDHGHNVTMLNHKRGPFMPDFKKEEKSYQVISW-LAPEDHQREFKKS 92 

II::: : I I I I I : I I : I : : : • I : : : : I . I I 

34 SHWMNIKIILDELVQRGHEVTVLKPSAYFFLDPKKSSDLKFEIFSTSISKDELQNHFIKL 93 



Qy 

Db 



Qy 93 FDFFLEE TLGGRGKFENLLNVLEYLALQ-CSHFLNRKDIMDSLKNENFDMVIVET 146 

I : I I : I I : III : : I : I I : I I : : : 

Db 94 LDVWTYELPRDTCLSYSPILQNLVYEFSYFYLSICKDAVSNKQLMTKLQESKFDVLFADP 153 

Qy 147 FDYCPFLIAEKLGKPFVAILSTSFG-SLEFGLP IPLSYVPVFRSLLTDHMDFWGRVK 202 

I I I I I I I I : I I I I I I : : I I I I I I I I Mill 
Db 154 VASCGDLIAELLHIPFLYSLSFSPGHKLEKSIGKFILPPSYVPVILSGLAGKMTFIDRVK 213 

Qy 203 NF— LMFFSF-CRRQQHMQ-STFDNTIKEHFTEGSRPVLSHLLLKAELWFINSDFAFDFA 258 

I : : : I I I : I : I I : I I : : I I : I I I : I 

Db 214 NMICMLYFDFWFERLRHKEWDTFYSEIL GRPTTVDETMSKVEIWLIRSYWDLKFP 268 

Qy 259 RPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTLGSMVNTCQNPEIFKEMNN 318 

I I I I 1:111 I | I I : I : I : I I : I : I I : : I I I I I : : : I I 

Db 269 HPTLPNVDYIGGLHCKPAKPLPKDMEEFVQSSGEHGVWFSLGSMVS NMTEEKAN 323 

Qy 319 AFAHLPQGVIWKCQCSHWPKDVHI^W^VKIVDWLPQSDLIAHPSIRLFVTHGGQNS 374 

I I : I I I : I I I I : : I I I I : I I I I I : I I I I I I I 

Db 324 AIAWALAQI PQKVLWKFD GKTPATLGPNTRVYKWLPQNDLLGHPKTKAFVTHGGANG 380 

Qy 375 IMEAIQHGVP1WGIPLFGDQPENMVRVEAKKFG 434 

: i I I I I : I I : I I I I I I I I I : I : : I I II:::: :::::: | | 

Db 381 L YEAI YHGI PMI G I P L FGDQ P DN I AHMVAKGAAVS LN I RTMS KLD FL S ALVEVI DN P F YK 440 

Qy 435 SAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPWHEQYLFDVFVFLLG 494 

: II I : I MM:::: I I M : I I I : : : II III 

Db 441 KNVMLLSTIHHDQPMKPLDRAVFWIEFIMRHKG7VKHLRPLGHNLPWYQYHSLDVIGFLLT 500 

Qy 495 LTLGTLWLCGK-LLGMAVWWLRGARKVK 521 

I II I I :::: :|:| 
Db 501 CFAVIAALTVKCLLFMYRFFVKKEKKMK 528 



RESULT 5 
A42233 

glucuronosyl trans f erase (EC 2.4.1.17) 2 - rat 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 19-Jun-1992 #sequence_revision 18-Sep-1992 #text_change 29-Sep-1999 

C;Accession: A42233; A24324 

R;Mackenzie, P.I.; Rodbourn, L. 

J. Biol. Chem. 265, 11328-11332, 1990 

A; Title: Organization of the rat UDP-glucuronosyltransf erase, UDPGTr-2, gene and 
characterization of its promoter. 

A;Reference number: A42233; MUID : 90293083 ; PMID:2113533 

A;Accession: A42233 

A; Status : preliminary 

A; Molecule type: DNA 

A; Residues: 1-52 9 <MAC> 

A; Cross-references : GB: J054 82 

A;Note: the authors translated the codon GTA for residue 57 as B, and GTC for 
residue 116 as B 
R;Mackenzie, P.I. 

J. Biol. Chem. 261, 6119-6125, 1986 

A; Title: Rat liver UDP-glucuronosyltransf erase . Sequence and expression of a 

cDNA encoding a phenobarbital-inducible form. 

A; Reference number: A24324; MUID: 86196018 ; PMID:3084479 



A/Accession: A24324 

A;Molecule type: mRNA 

A;Residues: 1-407, 'V , 409-529 <MA2> 

A;Cross-references: GB:M13506; NID:g207580; PIDN: AAA42313 . 1; PID:g207581 

A; Experimental source: liver 

C; Super family : glucuronosyl transferase 

C;Keywords: glycosyltransf erase; hexosyltransf erase; transmembrane protein 

Query Match 24.2%; Score 671; DB 2; Length 529; 

Best Local Similarity 32.2%; Pred. No. 1.2e-44; 

Matches 175; Conservative 90; Mismatches 228; Indels 50; Gaps 15; 

Qy 6 VLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKRGPFMP 65 

: : | : : I I I : I I I I : : : : I I I I I : I : 

Db 11 LIQLICYFRPGA CGKVLVWPT- E YSHWIN I KI I LNELAQRGHEVTVLVS SAS I LI E 65 

Qy 66 DFKKEEKSYQVISW-LAPEDHQREFKKSFDFFLE — ETLG GRGKFENLLN VLE 115 

I : ::: : | I : I : I I I : III I : : I I : I 

Db 66 PTKESSINFEIYSVPLSKSDLEYSFAKWIDEWTRDFETLSIWTYYSKMQKVFNEYSDWE 125 

Qy 116 YLALQCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFG 171 

I I : I : I I : I I : : : : I I : I I I I I I I 

Db 12 6' NL CKALIWNKSLMKKLQGSQFDVT LADAVGPCGELLAELLKTPLVYSLRFCPGYRCE 182 

Qy 172 SLEFGLPIPLSYVPVFRSLLTDHMDFWGRVKNFL — MFFSFCRRQQHMQSTFDNTIKEHF 229 

I I I : I I I I I I I I : I I I I I I I I : : I I I : : : : 

Db 183 KFSGGLPLPPSYVPWLSELSDRMTFVERVKNMLQMLYFDF WFQPFKEKSWSQFY 237 

Qy 230 TE — GSRPVLSHLLLK7VELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFI 287 

: : I | : : : | | : : | | : : : | III : I I I I II I I : I : : : I I : 

Db 238 SDVLGRPTTLTEMMGKADIWLIRTFWDLEFPHPFLPNFDFVGGLHCKPAKPLPREMEEFV 297 

Qy 288 AKFGDSGFVLVTLGSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDWLAANVK 347 

I : I I : : I I I I I I : : I I : I I I : I : I I : I : 

Db 298 QSSGEHGVWFSLGSMVKNL-TEEKANWASALAQIPQKVVWRFD GKKPDTLGSNTR 353 

Qy 348 IVT)WLPQSDLI^PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFG 407 

: I : I I : I I I I I : I I I I I I I I I I I I : I : I I I I I I I I I : I : : : I I 
Db 354 LYKWIPQNDLLGHPKTKAFVAHGGTNGIYEAIYHGIPIVGIPLFADQPDNINHMBAKGAA 413 

Qy 408 VS IQLKKLKAETLALKMKQIMEDKRYKSAAVAASVI LRSHPLS PTQRLVGWI DHVLQTGG 467 

I : I I : I : I I I I I : I I I : I I I I I : : I : : I 

Db 414 VRVDFSILSTTGLLTALKIVMNDPSYKENAMRLSRIHHDQPVKPLDRAVFWIEYVMRHKG 473 

Qy 468 ATHLKPYVFQQPWHEQYLFDVFVFLLGLTLGT LWLCGKLLGMAVWWLRGARK 519 

III:: I :: I I I I I : I I : I I I : I 

Db 474 AKHLRSTLHDLSWFQYHSLDVIGFLLLCWGWFIITKFCLFCCRKTANM GKK 526 

Qy 52 0 VKE 522 

I I 

Db 527 KKE 529 



RESULT 6 
B47113 

glucuronosyltransferase (EC 2.4.1.17) UGT2B13 precursor - rabbit 
C; Species: Oryctolagus cuniculus (domestic rabbit) 



C;Date: 03-May-1994 #sequence_revision 03-May-1994 #text_change 29-Sep-1999 
C;Accession: B47113 

R;Tukey, R.H.; Pendurthi, U.R.; Nguyen, N.T.; Green, M.D.; Tephly, T.R. 
J. Biol. Chem. 268, 15260-15266, 1993 

A; Title: Cloning and characterization of rabbit liver UDP- 

glucuronosyltransf erase cDNAs . Developmental and inducible expression of 4- 
hydroxybiphenyl UGT2B13. 

A; Reference number: A47113; MUID : 93315511 ; PMID: 8325897 
A; Accession: B47113 
A; Status: preliminary 
A; Molecule type: mRNA 
A; Residues: 1-531 <TUK> 

A; Cross-references: GB:L01081; NID:gl65796; PIDN:AAA18020 . 1; PID:gl65797 
C; Superf amily: glucuronosyltrans f erase 

C; Keywords: glycosyltransf erase; hexosyltransf erase; transmembrane protein 

Query Match 24.2%; Score 670; DB 2; Length 531; 

Best Local Similarity 32.6%; Pred. No. 1.4e-44; 

Matches 169; Conservative 90; Mismatches 206; Indels 54; Gaps 14; 

Qy 34 SHYLLMDRVSQI LQDHGHN VTML NHKRG PFMPDFKKEEKSYQVISW 79 

I I : : I : I I I I I : I I : : I I : : I : I : I 

Db 35 SHWMNMKT I LDALVQQGHEVTVLRS S AS I YI GSNNES GI KFET FHT S YRKDEI EN FFMDW 94 

Qy 80 LAPEDHQREFKKSFDFFLE ETLGGRGKFENLLNVLEYLAL QCSHFLNRKDIMDS 133 

: I :: : I II : : I : I : I : I : I 

Db 95 F YKMI YNVSIESYWETFS LTKMVILKYSDICEDICKEVI LNKKLMTK 141 

Qy 134 LKNENFDMVIVETFDYCPFLIAEKLGKP FVAILSTSFGSLEFGLPIPLSYVPV 186 

I : I I : I : : I : I I I I II : I I I : I I I I I I 
Db 142 LQESRFDWLADPVSPGGELLAELLKIPLVYSLRGFVGYMLQKHGG GLLLPPSYVPV 198 

Qy 187. FRSLLTDHMDFWGRVKNFL— MFFSFCRRQQHMQSTFDNTIKEHFTEGSRPV-LSHLLLK 243 

II I I I I : I I : : I I : : : I I III hi 

Db 199 MMSGLGSQMT FMERVQNLLCVLYFDFW-FPKFNEKRWDQFYSEVL GRPVTFLELMGK 254 

Qy 244 AELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTLGSM 303 

I : : I I I : : I I I I I I I : : I I I I I I I : I I : : I : I : I : I I : : I I I I 
Db 255 ADMWLIRSYWDLEFPRPLLPNFDFIGGLHCKPAKPLPQEMEDFVQSSGEEGVWFSLGSM 314 

Qy 304 WTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHI^^ 363 

: : I : : I I I I I I : I : : I I : I : : I : I I : I I I I I 

Db 315 ISNL-TEERANVIASALAQLPQKVLWRFE GKKPDMLGSNTRLYKWI PQNDLLGHPKT 370 

Qy 364 RLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETLALK 423 

: I : I I I I I : I I I I I : I I II : I I I I I I : I : I : : I I I : I I : : I 
Db 371 KAFITHGGANGVFEAIYHGIPMVGLPLFGDQLDNIVYMKAKGAAVKLNLKTMSSADLLNA 430 

Qy 424 MKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPWHEQ 483 

: I : : I II I : II I : I I I I I : : I : : I I I I : I : : 

Db 431 LKTVINDPSYKENAMTLSRIHHDQPMKPLDRAVFWIEYVMRHKGAKHLRV7\7^HDLTWYQY 4 90 

Qy 484 YLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKE 522 

: I I II I I : I I : : : I I I I : 

Db 4 91 HSLDVIGFLLACVAITTYLIVKCCLLVYRYVLGAGKKKK 529 



RESULT 7 
C47113 

glucuronosyltransferase (EC 2.4.1.17) UGT2B14 precursor - rabbit 
C; Species: Oryctolagus cuniculus (domestic rabbit) 

C;Date: 03-May-1994 #sequence_revision 03-May-1994 #text_change 29-Sep-1999 
C; Access ion: C47113 

R;Tukey, R.H.; Pendurthi, U.R.; Nguyen, N.T.; Green, M.D.; Tephly, T.R- 
J. Biol. Chem. 268, 15260-15266, 1993 

A; Title: Cloning and characterization of rabbit liver UDP- 

glucuronosyltransf erase cDNAs . Developmental and inducible expression of 4- 
hydroxybiphenyl UGT2B13. 

A;Reference number: A47113; MUID: 93315511 ; PMID: 8325897 
A;Accession: C47113 
A; Status : preliminary 
A;Molecule type: mRNA 
A; Residues: 1-530 <TUK> 

A;Cross-references: GB:L01082; NID:gl65798; PIDN: AAA18021 . 1 ; PID:gl65799 
C; Super family: glucuronosyltransferase 

C;Keywords: glycosyltransf erase; hexosyltransf erase; transmembrane protein 

Query Match 23.9%; Score 662.5; DB 2; Length 530; 

Best Local Similarity 34.5%; Pred. No. 5.5e-44; 

Matches 161; Conservative 75; ..Mismatches 209; Indels 21; Gaps 10; 

Qy 50 GHNVTMLNHKRGPFMPDFKKEEKSYQVISWLAPEDHQRE-FKKSFDFFLEETLGGRGKFE 108 

I I I : I : I : I : : : I : I : I : : ' : 

Db 51 GHEVIVLRNSASIFIDPSKQT^NIKFETFPIAATKDDLEDLFVHYVSTWTNARQNSQWKYF 110 

Qy 109 NLLNVL— EY LALQCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFV 163 

: I I | || I : I : I I : II:::: I hill Ml 

Db 111 SLLQKLFSEYSDSCENACKEWFNKTLMTKLQESRFDILLSDAIGPCGELLAELLKIPFV 170 

Qy 164 AILSTSFG SLEFGLPIPLSYVPVFRSLLTDHMDFWGRVKNFL — MFFS FCRRQQHM 217 

| : | I i : I I I I h I h II I I I I : : I I I 

Db 171 YSLRFTPGYTMEKYSGGLSVPPSYVPIILSDLSGKMTFMERVNNMLCMLYFDFW- FQMFN 22 9 

Qy 218 QSTFDNTIKEHFTEGSRPV-LSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPI 276 

: : | | III | | : | | : : | I I : : I I I I I I : I I I I II 

Db 230 KKRWDQFYSEVL GRPVTFSELVGKADMWLIRSYWDLEFPRPTLPNIQFVGGLHCKPA 286 

Qy 277 KPVPQDLENFIAKFGDSGFVTiVTLGSMVNTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHW 336 

| | : | : : : I I : I : I I : : I I I I I : I : : I M I I I I I h 
Db 287 KPLPKEMEEFVQSSGEEGWVFSLGSMVSN-MTEERANLIASAFAQLPQKVIWRFD G 342 

Qy 337 PKDVli]J^ANVKIVDWLPQSDLLAHPSIRLFVTHGGQNSIMEAIQH 396 

| | I : I I I : I I : I I I I I : I I I I I I I I I I I I I : I I I h I I I h I h 

Db 343 QKPETLGPNTRIYDWIPQNDLLGHPKTKAFVTHGGANGIYEAIHHGIPMVGLPLFGEQPD 402 

Qy 397 NMVRVEAKKFGVSIQLKKLKAETLALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLV 456 

I : : II : : I : : I I : I : : I I I : I I hill 
Db 403 N I AHMT AKGAAI RLNWKTMS S E DL LNAL KT VI ND P S YKEN VMT L S S I HH DQ PMK P L DRAV 462 

Qy 457 GWIDHVLQTGGATHLKPYVFQQPWHEQYLFDVFVFLLGLTLGTLWL 502 

I I : : I : : I I I I : I : : II I h : : I 

Db . 463 FWIEYVMRHKGAKHLRVAAHDLTWFQYHSLDWGFLVSCAAFLIFL 508 



RESULT 8 
A35343 

glucuronosyltransf erase (EC 2.4.1.17) - rat 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 14-Sep-1990 #sequence_revision 14-Sep-1990 #text_change 29-Sep-1999 
C; Accession: A3 534 3 

R;Sato, H.; Koiwai, 0.; Tanabe, K. ; Kashiwamata, S. 

Biochem. Biophys . Res. Commun. 169, 260-264, 1990 

A;Title: Isolation and sequencing of rat liver bilirubin UDP- 

glucuronosyltransferase cDNA: possible alternate splicing of a common primary 
transcript . 

A;Reference number: A35343; MUID: 90274676; PMID:2112380 
A;Accession: A35343 
A; Status: preliminary 
A; Molecule type: mRNA 
A; Residues: 1-531 <SAT> 

A;Cross-references: GB:M34007; NID:g207578; PIDN : AAA42312 . 1 ; PID:g207579 
C; Superfamily : glucuronosyltransf erase 

C;Keywords: glycosyltransf erase; hexosyltransf erase; transmembrane protein 

Query Match 23.8%; Score 658; DB 2; Length 531; 

Best Local Similarity 30.9%; Pred. No. 1.2e-43; 

Matches 171; Conservative 106; Mismatches 211; Indels 66; Gaps 17; 

Qy 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

: | |||: || : I I : I : : I I I : I I I : I II : I 
Db 7 LRGLSGLLLLLCALP WAEGGKVX-VFPMEGSHWLSMRDWRELHARGHQAVVL 58 

Qy 61 GPFMPDFKKEEKSYQVISWLAP EDHQREF KKSFD FFLEETLGGRGK 106 

| : | | : : : : | I : : I I I III: II lh I 

Db 59 APEVTVHMKGEDFFTLQTYAFPYTKEEYQREILGNAKKGFEPQHFVKTFF — ETMASIKK 116 

Qy 107 FENLLNVLEYLALQCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAIL 166 

|:| ||: I : I : : I : : I I : I : : I I : I : I II I 

Db 117 FFDL YAN S CAALLHNKT L I QQLN S S S FDWLT D P VF P CGALLAKYLQ I PAVFFL 17 0 

Q y 167 STSFGSLEF GLPI PLSYVPVFRSLLTDHMDFWGRVKNFLMFFS FCRRQQHMQST 22 0 

: : : : I I I I : I : : I : I I I I II I I I : I I : I 
Db 171 RSVPCGIDYEATQCPKPSSYIPNLLTMLSDHMTFLQRVKNMLYPLTLKYIC HLSIT 226 

Qy 221 FDNTIKEHFTEGSR PVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIK 277 

: : : I I I I I : I I I I I : I I : : I I I : : I I : I 
Db 227 PYESLASELLQREMSLVEVLSH ASVWLFRGDFVFDYPRPIMPNMVFIGGINCVIKK 282 

Qy 27 8 PVPQDLENFIAKFGDSGFVLVTLGSMVNTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWP 337 

I : I : I : : I : I I : : I I I I I : : I : I : I I : : I : : 

Db 283 PLSQEFEAYVNASGEHGIWFSLGSMVSEIPEKKAM-EIAEALGRIPQTLLWRYTGT— 338 

Qy 338 KDWLAANVKIVDWLPQSDLLAHPSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPEN 397 

: : | | I : I I I I I : I I I I I I I : I I I : I I I : I I I I I : I I I i I I : I 

Db 339 RPSNLAKNTILVKWLPQNDLLGHPKARAFITHSGSHGIYEGICNGVPMVMMPLFGDQMDN 398 

Qy 398 MVRVEAKKFGVSIQLKKLKAETLALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVG 457 

| : | : I I : : : : : I : I : I : : : I I I : I : : I : I I 
Db 399 AKRMETRGAGVTLNVLEMTADDLENALKTVINNKS YKENIMRLS S LHKDRPI EPLDLAVF 458 



QY 



458 WIDHVLQTGGATHLKPYVFQQPWHEQYLFDVFVFLLGLTLGTLWL CGKLLGM 509 



|:::|:: || ||:| I:: : II III : I II I 

Db 459 WVEYVMRHKGAPHLRPAAHDLTWYQYHSLDVIGFLLAIVLTWFIVYKSCAYGCRKCFG- 517 

Qy 510 AVWWLRGARKVKET 523 

I : I I : : 

Db 518 GKGRVKKS 525 



RESULT 9 
157961 

glucuronosyltransf erase (EC 2.4.1.17) precursor - rat 
N;Alternate names: glucuronosyltransf erase 1 Bl; morphine UGT 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 12-Aug-1996 #sequence_revision 12-Aug-1996 #text_change 29-Sep-1999 
C;Accession: 157961; S51197; S68333 

R;Coffman, B.L.; Green, M.D.; King, CD.; Tephly, T.R. 
Mol. Pharmacol. 47, 1101-1105, 1995 

A; Title: Cloning and stable expression of a cDNA encoding a rat liver UDP- 

glucuronosyltransferase (UDP-glucuronosyltransf erase 1.1) that catalyzes the 

glucuronidation of opioids and bilirubin. 

A;Reference number: 157961; MUID : 95327065 ; PMID:7603447 

A;Accession: 157961 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: DNA 
A; Residues: 1-535 <RES> 

A; Cross-references: EMBL:U20551; NID:g695161; PIDN : AAC52219 . 1 ; PID:g695162 
R;Ishii, Y. ; Tsuruda, K. ; Tanaka, M. ; Oguri, K. 
Arch. Biochem. Biophys. 315, 345-351, 1994 

A; Title: Purification of a phenobarbital-inducible morphine UDP- 

glucuronyltransferase isoform, absent from gunn rat liver. 

A;Reference number: S51197; MUID: 95077409; PMID:7986077 

A;Accession: S51197 

A;Molecule type: protein 

A;Residues: 30-41 <ISH> 

R;Ikushiro, S.; Emi, Y. ; Iyanagi, T. 

Arch. Biochem. Biophys. 324, 267-272, 1995 

A; Title: Identification and analysis of drug-responsive expression of UDP- 
glucuronosyltransf erase family 1 (UGTl) isozyme in rat hepatic microsomes using 
anti-peptide antibodies. 

A;Reference number: S68333; MUID: 96132654 ; PMID:8554318 

A;Accession: S68333 

A;Molecule type: protein 

A; Residues: 30-37 <IKU> 

C; Genetics : 

A; Gene: UGT1.1 

C; Superf amily : glucuronosyltransf erase 

C; Keywords : glycosyltransf erase; hexosyltransf erase 

Query Match 23.8%; Score 658; DB 2; Length 535; 

Best Local Similarity 31.4%; Pred. No. 1.3e-43; 

Matches 171; Conservative 97; Mismatches 215; Indels 62; Gaps 14; 

Qy 13 LLPGVLL SEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKRGPFMP 65 

|||:|| 111:11 : I I I : I I I I I I III:: I 

Db 13 LLPCLLLCVLGPSASHAGKLLVI-PIDGSHWLSMLGVIQQLQQKGHEVWI APEAS 67 

Qy 66 DFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFEN LLNVLE 115 



II I : : I :: |:| III: III:: 

Db 68 I H I KEGS FYTMRKYPVP FQNENVT AA FVEL GRSVFDQDPFLLRWKTYNKVKR 120 

Qy 116 YLALQCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGS 172 

I Ml I : : I I I : : I I : : : I I : : I : I I I I : I 

Db 121 DSSMLLSGCSHLLHNAEFMASLEQSHFDALLTDPFLPCGSIVAQYLSLPAVYFLNALPCS 180 

Qy 173 LEF GLPIPLSYVPVFRSLLTDHMDFWGRVKNFLMFFS FCRRQQHMQ S T FDNT I K 226 

I : I I I I II I I I I I : I I I I I : : : II : I 

Db 181 LDLEATQCPAPLSYVPKSLSSNTDRMNFLQRVKNMIIALTENFLCRWYSPYGSLATEIL 240 

Qy 227 EHFTEGSRPVLSHLLLK7VELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENF 286 

: : || I : I : : I I I : I I : : I I I : : I I : I : |: I : 

Db 241 Q KEVTVKDLLSPASIWLMRNDFVKDYPRPIMPNMVFIGGINCLQKKALSQEFEAY 295 

Qy 287 IAKFGDSGFVLWLGSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHLAANV 34 6 

: I : I I : : I I I I I : : I : I : I I I : I : : : : I I I 
Db 296 VNASGEHGIWFSLGSMVSEIPEKKAM-EIAEALGRIPQTVLWRYTGT RPSNLAKNT 351 

Qy 347 KIVDWLPQSDLLAHPSIRLFWHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKF 406 

: I I I I I : I I I I I I I : I I I : I I I : I I I I I : I I I I I I : I I : I : 

Db 352 ILVKWLPQNDLLGHPKARAFITHSGSHG1YEGICNGVPMVMMPLFGDQMDNAKRMETRGA 411 

Qy 4 07 GVSIQLKKLKAETLALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTG 4 66 

| | : : : : : I : I : I : : : I I I : I : : I : I | | : : : I : : 
Db 412 G VT LN VL EMT AD D L EN AL KTVINNKSYKENIMRLSSLHKDRPIEPLD LAVFWVE YVMRH K 471 

Qy 4 67 GATHLKPYVFQQPWHEQYLFDVFVFLLGLTLGTLWL C GKL L GMAVWW L RGAR 518 

I | | I : I I : : : II I I I : I : : : II I I 
Db 472 GAPHLRPAAHDLTWYQYHSLDVIGFLLAIVLTWFIVYKSCAYGCRKCFG GKG 524 

Qy 519 KVKET 523 

: I I : : 

Db 525 RVKKS 52 9 



RESULT 10 
JN0619 

glucuronosyltransf erase (EC 2.4.1.17) 2B-4 precursor - human 
N; Alternate names: UDP-glucuronosyltransf erase 2B-11 
C; Species: Homo sapiens (man) 

C;Date: 31-Dec-1993 #sequence_revision 31-Dec-1993 #text_change 02-Jun-2000 
C;Accession: JN0619; A27878 

R;Jin, C.J.; Miners, J.O.; Lillywhite, K.J.; Mackenzie, P.I. 
Biochem. Biophys . Res. Cornmun. 194, 496-503, 1993 

A; Title: cDNA cloning and expression of two new members of the human liver UDP- 
glucuronosyltransf erase 2B subfamily. 

A; Reference number: JN0619; MUID : 93326164 ; PMID: 8333863 
A;Accession: JN0619 
A;Molecule type: mRNA 
A; Residues: 1-528 <JIN> 

A;Cross-references: GB:AF081793; NID:g3426331 
A; Experimental source: liver 

R; Jackson, M.R.; McCarthy, L.R.; Harding, D . ; Wilson, S.; Coughtrie, M.W.H.; 
Burchell, B. 

Biochem. J. 242, 581-588, 1987 

A;Title: Cloning of a human liver microsomal UDP-glucuronosyltransf erase cDNA. 



A; Reference number: A27878; MUID : 8724 1362 ; PMID: 3109396 
A;Accession: A27878 
A; Molecule type: mRNA 

A; Residues : 1-108 , 'F\ 110-170, ' RP ' , 173-381 , 'K', 383-384, ' SPR' , 388-395, 'F', 397-528 
<JAC> 

A;Cross-references: GB:Y00317; NID:g37588; PIDN : CAA684 15 . 1 ; PID:g37589 
C; Genetics: 

A; Gene: GDB : UGT2B4 ; UGT2B11 

A;Cross-references: GDB: 5891331; OMIM: 600067 

A; Map position: 4ql3-4ql3 

C; Superf amily: glucuronosyltransf erase 

C; Keywords : glycoprotein; glycosyltransf erase ; hexosyltransf erase; transmembrane 
protein 

F; 1-2 3 /Domain : signal sequence ((status predicted <SIG> 

F;24-528/Product : glucuronosyltransf erase 2B-11 ((status predicted <MET> 
F; 4 92- 50 9 /Domain : transmembrane ((status predicted <TMM> 
F;315/Binding site: carbohydrate (Asn) (covalent) ((status predicted 

Query Match 23.7%; Score 656.5; DB 2; Length 528; 

Best Local Similarity 33.9%; Pred. No. 1.6e-43; 

Matches 173; Conservative 85; Mismatches 213; Indels 39; Gaps 15; 



Qy 34 SHYLLMDRVSQI LQDHGHNVTMLNHKRG -PFMPDFKKEEKSYQVISWLAPEDHQREF 89 

II:::: I I I I I : I II II : : II- 

Db 34 SHWMNI KTI LDELVQRGHEVTVLAS SAS I S FDPNS PSTLKFEVYPVSLTKTEFEDI I KQL 93 

Qy 90 KKSFDFFLEETLGGRGKFENLLNVLEYLALQCSHFLNRKDIMDSLKNENFDM 141 

I : I : I : I : : I I : : I : I I : I I : 

Db 94 VKRWAELPKDTFWSYLSQVQEIMWTFNDILRKF CKDIVSNKKLMKKLQESRFDV 147 

Qy 142 VIVETFDYCPF— LIAEKLGKPFVAILSTSFG-SLE FGLPI PLS YVPVFRSLLTDHM 195 

I : : II hill III I I I : : I II I I I I I I I hi I 

Db 148 VLADA — VFPFGELLAELLKIPFWSLRFSPGYAIEKHSGGLLFPPSYVPVVMSELSDQM 205 

Qy 196 DFWGRVKN— FLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHLLLKAELWFINSDF 253 

| | | | I : : : : I I : I : : I I I I I : I I : : I I : : 

Db 2 06 TFIERVKNMIYVLYFEFWFQIFDMKK-WDQFYSE — VLGRPTTLSETMAKADIWLIRNYW 262 

Qy 254 AFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTLGSMV-NTCQNPEI 312 

II II I I I : I I I I I I I I : I : : : I I : I : : I I : : I I I I I I I : I 
Db 2 63 DFQFPHPLLPNVEFVGGLHCKPAKPLPKEMEEFVQSSGENGVWFSLGSMVSNTSE--ER 320 

Qy 313 FKEMNNAFAHLPQGVIWKCQCSHWPKDVliLAANVKI VT)WLPQSDLLM 372 

: : I I : I I I : I : : I I I : : I : I I : I I I I I I I : I I I I 
Db 321 ANVI ASALAKI PQKVLWRFDGN KPDTLGLNTRLYKWI PQNDLLGHPKTRAFITHGGA 377 

Qy 373 NSIMEAIQHGVPMVGIPLFGDQPENMVWEAKKFGVSIQLKK^ 4 32 

I I I I I I I : I I I I : I I I I I : I : : : I I I I : : : I : I : : I 

Db 378 NGIYEAIYHGIPMVGVPLLADQPDNIAHMKAKG7UVVSLDFHTMSSTDLLNALKTVINDPL 437 

Qy 433 YKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPWHEQYLFDVFVFL 492 

II I : II I : I I I I I : I : : MM: I : : II II 

Db 438 YKENAMKLSRIHHDQPVKPLDRAVFWIEFVMRHKGAKHLRVAAHDLTWFQYHSLDVTGFL 497 



Qy 493 LGLTLGTLWLCGKLLGMAVW-WLRGARKVK 521 

I : : : II I I : : I Ml 

Db 498 LAC VATVT FI ITKCL- FCVWKFVRTGKKGK 526 



RESULT 11 
A35366 

glucuronosyltransf erase (EC 2.4.1.17) UDPGTh-2 precursor - human 
C; Species: Homo sapiens (man) 

C;Date: 17-Aug-1990 #sequence_revision 17-Aug-1990 #text_change 29-Sep-1999 
C;Accession: A35366 

R;Ritter, J.K.; Sheen, Y.Y.; Owens, I.S. 
J. Biol. Chem. 265, 7900-7906, 1990 

A;Title: Cloning and expression of human liver UDP-glucuronosyltransf erase in 

COS-1 cells. 3,4-Catechol estrogens and estriol as primary substrates. 

A; Reference number: A35366; MUID: 90243659; PMID:2159463 

A;Accession: A35366 

A; Status : preliminary 

A; Molecule type: mRNA 

A; Residues: 1-529 <RIT> 

A;Cross-references: GB:J05428; NID : g340079 ; PIDN: AAA367 93 . 1 ; PID:g340080 
C; Genetics : 

A; Gene: GDB : UGT2B7 ; UGT2B9 

A; Cross-references: GDB: 5892203; OMIM: 600218 

A; Map position: 4ql3-4ql3 

C ; Super family : glucuronos yltrans f erase 

C;Keywords: glycosyltransf erase ; hexosyltransf erase; transmembrane protein 

Query Match 23.6%; Score 654.5; DB 2; Length 529; 

Best Local Similarity 31.5%; Pred. No. 2.3e-43; 

Matches 163; Conservative 88; Mismatches 212; Indels 55; Gaps 14; 



Qy 


34 


SHYLLMDRVSQILQDHGHNVTMLNHKRG-PFMPD FKKEEKSY Q 

II:::: 1 1 1 1 1 : 1 1 h 1 1 : : : 1 

SHWMNIKTILDELIQRGHEVTVLASSASILFDPNNSSALKIEIYPTSLTKTELENFIMQQ 


75 


Db 


34 


93 


Qy 


76 


VISWL-APEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ-CSHFLNRKDIMDS 


133 


Db 


94 


: | | : | : | : : : : : : : : : I : : 1 1 
IKRWSDLPKD TFWLYFSQV QEIMSIFGDITRKFCKDWSNKKFMKK 


139 


Qy 


134 


LKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTS FGSLEFGLPI PLS YVPVFRS 

:: II:: : 1 1 : 1 1 III III 1 1 1 1 II 1 1 1 

VQESRFDVIFADAIFPCSELIAELFNIPFVYSLSFSPGYTFEKHSGGFIFPPSYVPWMS 


189 


Db 


140 


199 


Qy 


190 


LLTDHMDFWGRVKN- - FLMFFS FCRRQQHMQST FDNT I KEHFT E GSRPVLSHLLLKA 

III 1 1 MM 1 II : 1 1 1 1 : 1 1 
ELTDQMT FMERVKNMI YVL YFDF WFEI FDMKKWDQFYSEVLGRPTTLSETMGKA 


244 


Db 


200 


253 


Qy 


245 


ELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTLGSMV 

: : 1 1 : : 1 1 IMM : 1 1 1 1 I I I I : I : : : I : 1 : 1 : : 1 1 : : 1 1 1 1 1 

DVWLIRNSWNFQFPHPLLPNVDFVGGLHCKPAKPLPKEMEDFVQSSGENGVWFSLGSMV 


304 


Db 


254 


313 


Qy 


305 


NTCQNPEIFKENmNAFAHLPQGVIWKCQCSHWPKDVHLAANVKIVDWLPQSDLLAHPSIR 
: I : :| | :|| |:|: : 1 1 1 :: hlhlll II 1 
SN-MTEERANVTASALAQI PQKVLWRFDGN KPDTLGLNTRLYKWI PQNDLLGHPKTR 


364 


Db 


314 


369 


Qy 


365 


LFVTHGGQNSIME^IQHGVPMVGIPLFGDQPENMVT^VTIAKKFGVSIQLKKLKAETLA^ 

1 : 1 1 1 1 1 1 1 1 1 1 1 : 1 1 II 1 1 1 1 1 1 1 : 1 : : : 1 : 1 : : : 1 • 

AFITHGGANGIYEAI YHGIPMVGIPLFADQPDNIAHMKARGAAVRVDFNTMSSTDLLNAL 


424 


Db 


370 


429 



425 KQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPWHEQY 484 

|::: I II : I I |: I | | -| I : I : : Mil: I : : 

430 KRVINDPSYKENVMKLSRIQHDQPVKPLDRAVFWIEFVMRHKGAKHLRVAAHDLTWFQYH 489 

485 LFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKE 522 

II III :: : I I : I I I h 

490 SLDVIGFLLVCVATVIFIVTKCCLFCFW — KFARKAKK 525 



RESULT 12 
S15089 

glucuronosyltransf erase (EC 2.4.1.17) - rat 
C; Species: Rattus norvegicus (Norway rat) 

C;Date: 21-Nov-1993 #sequence_revision 10-Nov-1995 #text_change 20-Jun-2000 
C;Accession: S15089 

R;Lazard, D.; Zupko, K. ; Poria, Y. ; Nef, P.; Lazarovits, J.; Horn, S-; Khen, M. ; 
Lancet , D. 

Nature 349, 790-793, 1991 

A;Title: Odorant signal termination by olfactory UDP glucuronosyl transferase. 
A; Reference number: S15089; MUID : 91156050; PMID: 1900353 
A; Accession: S1508 9 
A;Molecule type: mRNA 
A; Residues: 1-527 <LAZ> 

A; Cross-references: GB:X57565; NID:g57762; PIDN : CAA4 0797 . 1 ; PID:g3980217 

C;Superfamily: glucuronosyltransf erase 

C;Keywords : glycosyltransf erase; hexosyltransf erase 

Query Match 23.5%; Score 650; DB 2; Length 527; 

Best Local Similarity 31.2%; Pred. No. 5.2e-43; 

Matches 166; Conservative 87; Mismatches 205; Indels 74; Gaps 12; 



Qy 


7 


LLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKRGPFMP- 

| | : I | 1 : I 1 : 1 1 1 : 1 : : 1 1111 = 1 1 ' 
LSLLGMSLGGNVL IWPMEGSHWLNVKIIIDELLRKEHNVTVLVASGALFITP 


65 


Db 


12 


63 


Qy 


66 


DFKKEE — KSYQVISWL APEDHQREFKKS FDFFLEETLGG 

Ml: | : | : : I 1 : 1 : 1 : II 
SVSPSLTFEIYPVPFGKEKIESVIKDF-VLTWLENRPSPSTIWTFYKEMAKVIEEFHLVS 


103 


Db 


64 


122 


Qy 


104 


RGKFENLLNVLEYLALQCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFV 
|| | | : : | | : | : : : : : 1 : : 1 1 1 1 1 1 : 
RG ICDGVLKNEKLMTKLQRGKFEVLLSDPVFPCGDIVALKLGIPFI 


163 


Db 


123 


168 


Qy 


164 


AILSTSFGSLEFG LPIPLSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCR 

M 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 : : 1 

Y SLRFSPASTVEKHCGKVPFPPSYVPAILSELTDQMSFADRVRNFISY R 


212 


Db 


169 


217 


Qy 


213 


RQQHMQSTFDNTIKEHFTE--GSRPVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGG 

| : | | : :: : | I : 1 1 1 : 1 : : : 1 : 1 1 1 1 1 1 : 1 II 
MQDYMFETLWKQWDSYYSKALGRPTTLCETMGKAEIWLMRTYWDFEFPRPYLPNFEFVGG 


270 


Db 


218 


277 


Qy 


271 


LMEKPIKPVPQDLENFIAKFGDSGFVLVTLGSMVNTCQNPEIFKEMNNAFAHLPQGVIWK 

| | | | | : | : : : | | : I : 1 1 : : 1 1 1 1 1 1 : : I I : I I I : 1 : 

LHCKP7VKPLPKEMEEFVQTSGEHGVWFSLGSMVKNL-TEEKANLIASALAQIPQKVLWR 


330 


Db 


278 


336 


Qy 


331 


CQCSHWPKDVIILAANVKIVDWLPQSDLI^PSIRLFVTHGGQNSIMEAIQHGVPMVGIPL 


390 



Db 



337 YK GKIPATLGSNTRLFDWIPQNDLLGHPKTRAFITHGGTNGIYEAIYHGIPMVGVPM 393 



Qy 391 FGDQ P ENMVRVEAKKFGVS I QLKKLKAET LALKMKQ I ME D KR YK S AAVAAS VI LRS H P L S 450 

| | | | : | : : : | | | : : : : I : : : : : II I : II I 2 

Db 394 FADQ P DN I AHMKAKGAAVEVNMNTMT S ADLL S AVRAVI N E P F YKENAMRL S RI HHDQ P VK 453 

Qy 451 PTQRLVGWIDHVLQTGGATHLKPYVFQQPWHEQYLFDVFVFLLGLTLGTLWL 502 

I I I I I : I : : MM: I : : II I I I = I 

Db 454 PLDRAVFWI EFVMRHKGAKHLRVAAHDLSWFQYHSLDVI GFLLACMASAI LL 505 



RESULT 13 
A48633 

glucuronosyltransf erase (EC 2.4.1.17) precursor - human 
N; Alternate names: dihydrotestosterone/androstanediol UDP- 
glucuronosyltransf erase isoform 3, udpgth-3; UDP glucuronosyltransf erase 2 
family, protein B15 
C; Species: Homo sapiens (man) 

C;Date: 07-Apr-1994 #sequence_revision 18-Nov-1994 #text_change 29-Sep-1999 
C;Accession: A48633; 138559 

R;Chen, F. ; Ritter, J.K.; Wang, M.G.; McBride, O.W. ; Lubet, R.A. ; Owens, I.S. 
Biochemistry 32, 10648-10657, 1993 

A;Title.: Characterization of a cloned human dihydrotestosterone/androstanediol 
UDP-glucuronosyltransf erase and its comparison to other steroid isoforms. 
A; Reference number: A48633; MUID : 94002056 ; PMID: 8399210 
A;Accession: A48633 

A; Status: preliminary; not compared with conceptual translation 

A;Molecule type: nucleic acid 

A; Residues: 1-530 <CHE> 

A; Experimental source: liver 

A;Note: sequence extracted from NCBI backbone (NCBIP : 138786) 
R;Green, M.D.; Oturu, E.M.; Tephly, T.R. 
Drug Metab. Dispos . 22, 799-805, 1994 

A;Title: Stable expression of a human liver UDP-glucuronosyltransf erase 
(UGT2B15) with activity toward steroid and xenobiotic substrates. 
A;Reference number: 138559; MUID : 95136867 ; PMID:7835232 
A;Accession: 138559 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-530 <RES> 

A; Cross-references: EMBL : U08854 ; NID:g475758; PIDN : AAC50077 . 1 ; PID:g475759 
C; Genetics: 

A; Gene: GDB : UGT2B15 ; UGT2B8 

A;Cross-references: GDB: 5892418; OMIM: 600219 

A; Map position: 4ql3-4ql3 

C; Superf amily : glucuronosyltransf erase 

C; Keywords : glycosyltransf erase; hexosyltransf erase 

Query Match 23.4%; Score 649; DB 2; Length 530; 

Best Local Similarity 31.8%; Pred. No. 6.3e-43; 

Matches 155; Conservative 90; Mismatches 218; Indels 24; Gaps 10; 

Qy 34 SHYLLMDRVSQILQDHGHNVTMLNHKRGPFMPDFKKEEKSYQVI-SWLAPEDHQREFKKS 92 

I I : : I : : I I I I I : I : I : I : I I ' I 

Db 34 SHWINMKTILEELVQRGHEVTVLTSSASTLVNASKSSAIKLEVYPTSLTKNDLEDSLLKI 93 



Qy 



93 FDFFLEETLGGRGKFENLLNVLEYLALQ 



CSHFLNRKDIMDSLKNENFDMVIV 14 4 



Db 


94 


LDRWIYGV — SKNTFWSYFSQLQELCWEYYDYSNKLCKDAVLNKKLMMKLQESKFDVILA 


151 


Qy 


145 


ETFDYCPFLIAEKLGKPFVAILSTSFG-SLE FGLP I P LS YVPVFRS LLTDHMD FWGR 

: : I I : 1 I 1 1 : 1 1 1 : 1 1 1 1 II 1 1 1 hi M 1 

DALNPCGELLAELFNIPFLYSLRFSVGYTFEKNGGGFLFPPSYVPWMSELSDQMIFMER 


200 


Db 


152 


211 


Qy 


201 


VKNF— LMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHLLLKAELWFINSDFAFDFA 

: | | : : : 1 1 : : : : 1 1 1 1 : I I 1 : 1 1 : : 1 : 1 

IKNMIHMLYFDFWFQIYDLKK-WDQFYSE--VLGRPTTLFETMGKAEMWLIRTYWDFEFP 


258 


Db 


212 


268 


Qy 


259 


RPLLPNTVWGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTLGSMVNTCQNPEIFKEMNN 

I I 1 1 1 : 1 1 1 1 1 1 1 1 : 1 : : : 1 1 : I : : I I : : 1 1 1 1 : : : 1 : : 

RPFLPNVDFVGGLHCKPAKPLPKEMEEFVQSSGENGIWFSLGSMISN-MSEESANMIAS 


318 


Db 


269 


327 


Qy 


319 


AF7\HLPQGVIWKCQCSHWPKDVHIJVANVKIVDWLPQSDLIJ^PSIRLFVTHGGQNSIM^ 

| | : | | | : | : 1 1 : 1 :: 1 1 1 1 : 1 1 1 1 1 : 1 : 1 1 1 1 1 1 1 1 

ALAQIPQKVLWRFD GKKPNTLGSNTRLYKWLPQNDLLGHPKTKAFITHGGTNGIYEA 


378 


Db 


328 


384 


Qy 


379 


IQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETIALKMKQIM^ 

1 ||:| II III 1 1 II :|: ::|| :|: :: : : 1 :| :: 1 II ' 


438 


Db 


385 


IYHGIPMVGIPLFADQHDNIAHMKAKGAALSVDIRTMSSRDLLNALKSVINDPVYKENVM 


444 


Qy 


/ion 


7\7\QWTT DQHDT QPTTlRT \/TWT fiHVT OTf^ ATHT ,KP YVFOOPWHF.OYLFDVFVFLLGLTLG 
J\t\o V _L J_i.Korl.tr J_iD Jr 1 vr\J_j v J_ un v ±j\j J. vjvj-AI o j_ix\ it x vc SdSi^ vv jjjl u v ±. v l ujjuui __i»_> 

|| I : I | | | I : I : : MM: 1 : : II III 

KLSRIHHDQPMKPLDRAVI^IEFVMRHKGAKHLRVAAHNLTWIQYHSLDVIAFLLACVAT 


498 


Db 


445 


504 


Qy 


499 


TLWLCGK 505 




Db 


505 


: : : 1 

VIFIITK 511 





RESULT 14 
JN0620 

UDP-glucuronosyltransf erase (EC 2.4.1.-) 2B-10 precursor - human 
C; Species: Homo sapiens (man) 

C;Date: 31-Dec-1993 #sequence_revision 31-Dec-1993 #text_change 28-May-1999 
C; Accession: JN0620 

R;Jin, C.J. ; Miners, J.O.; Lillywhite, K.J.; Mackenzie, P.I. 
Biochem. Biophys. Res. Commun. 194, 496-503, 1993 

A; Title: cDNA cloning and expression of two new members of the human liver UDP- 
glucuronosyltransf erase 2B subfamily. 

A; Reference number: JN0619; MUID: 93326164 ; PMID: 8333863 
A; Accession: JN0620 
A; Molecule type: mRNA 
A; Residues: 1-528 <JIN> 

A;Cross-references: GB:X63359; NID:g516149; PIDN: CAA44961 . 1; PID:g516150 

A; Experimental source: liver 

C; Superfamily: glucuronosyltransf erase 

C; Keywords: glycoprotein; glycosyltransf erase; hexosyltransf erase; transmembrane 
protein 

F; 1-23/Domain : signal sequence #status predicted <SIG> 

F;24-528/Product: UDP-glucuronosyltransf erase 2B-10 #status predicted <MET> 
F;491-508/Domain: transmembrane #status predicted <TMM> 

F;66,314,481/Binding site: carbohydrate (Asn) (covalent) ftstatus predicted 



Query Match 



23.2%; Score 642; DB 2; Length 528; 



Best Local Similarity 31.6%; Pred. No. 2.2e-42; 

Matches 167; Conservative 82; Mismatches 213; Indels 66; Gaps 13; 



Qy 31 VGGSHYLLMDRVSQI LQD HGHNVTMLNHKRGPFMPDFKKEEKSYQVISWLAPEDHQR 87 

I : I I : i I : : I I I I : I I : : II 

Db 27 VWAAEYS LWMNMKT I LKELVQRGHEVTVL ASSASIL — FDPNDSST 70 

Qy 88 EFKKSFDFFLEETLGGRGKFENLLNVL EYLALQ 120 

: : I : I : I I I : : I : : 

Db 71 LKLEVYPTSLTKT EFENIIMQLVKRLSEIQKDTFWLPFSQEQEILWAINDIIRNF 125 

Qy 121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFG-SLE FG 176 

I : : I : I I : I I : I : : I I : I I I I I I I I I I I 

Db 126 CKDWSNKKLMKKLQESRFDIVFADAYLPCGELLAELFNIPFVYSHSFSPGYSFERHSGG 185 

Qy 177 LPIPLSYVPVFRSLLTDHMDFWGRVKN — FLMFFSFCRRQQHMQSTFDNTIKEHFTEGSR 234 

I I I I I I I I : I I I I I I I : : : : I I : : I : : I I I 
Db 186 FIFPPSYVPWMSKLSDQMTFMERVKNMLYVLYFDFWFQIFNMKK-WDQFYSE--VLGRP 242 

Qy 235 PVLSHLLLKAELWFINSDFAFDF7VRPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSG 294 

II : I I : : I : : : I I I I I I : I I I I I I I I : I : : : I I : I : : I 
Db 243 TTLSETMRKADIWLMRNSWNFKFPHPFLPNVDFVGGLHCKPAKPLPKEMEEFVQSSGENG 302 

Qy 295 FVLWLGSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHLAANVKIVDWLPQ 354 

I : : I I I I I : I : I I : I I I : I : : I I I : : I : I I 

Db 303 VWFSLGSMVSN-MTEERANVIATALAKIPQKVXWRFDGN---KPDALGLNTRLYKWIPQ 358 

Qy 355 SDL LAH P S I RL FVT H G GQN S I MEAI QH G VPMVG IPLFGDQP ENMVRVE AK K FGVS I Q L KK 414 

: I I I II I I : I I I I I I I I I I I : I I I I I I I I I I I : I : : : I I I : 
Db 359 NDLLGHPKTRAFITHGGANGIYEAIYHGIPMVGIPLFFDQPDNIAHMKAKGAAVRVDFNT 418 

Qy 415 LKAETLALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPY 474 

: : I : I : : I II : II I : I I I I I : I : : I I I I : 

Db 419 MSSTDLLNALKTVINDPSYKENIMKLSRIQHDQPVKPLDRAVFWIEFVMRHKGAKHLRVA 47 8 

Qy 475 VFQQPWHEQYLFDVFVFLLGLTLGTLWLCGKLLGMAWWLRGARKVKE 522 

I : : II III I : : I I : I I I I : 

Db 479 AHNLTWFQYHSLDVIGFLLACVATVLFIITKCCLFCFW — KFARKGKK 524 



RESULT 15 
S07390 

glucuronosyltransf erase (EC 2.4.1.17) 3 precursor - rat 

N;Alternate names: 17beta-hydroxysteroid UDP-glucuronosyltransf erase ; UDP- 

glucuronosyltransf erase isoform 50K 

C; Species: Rattus norvegicus (Norway rat) 

C;Date: 30-Sep-1991 #sequence_revision 30-Sep-1991 #text_change 07-May-1999 
C;Accession: S07390; A33236; A28460; S59626 

R;Harding, D. ; Wilson, S.M.; Jackson, M.R.; Burchell, B. ; Green, M.D.; Tephly, 
T.R. 

Nucleic Acids Res. 15, 3936, 1987 

A; Title: Nucleotide and deduced amino acid sequence of rat liver 17beta — 

hydroxysteroid UDP-glucuronosyltransf erase . 

A; Reference number: S07390; MUID : 87231096; PMID: 3108864 

A; Accession: S07390 

A;Molecule type: mRNA 

A; Residues: 1-530 <HAR> 



A/Cross-references : EMBL: Y00156 

A; Experimental source: liver 

A;Accession: A33236 

A;Molecule type: protein 

A; Residues: 24-61 <HAR2> 

A; Experimental source: liver 

R;Mackenzie, P.I. 

J. Biol. Chem. 262, 9744-9749, 1987 

A;Title: Rat liver UDP-glucuronosyltransf erase . Identification of cDNAs encoding 
two enzymes which glucuronidate testosterone, dihydrotestosterone, and beta- 
estradiol . 

A;Reference number: A28460; MUID : 87250645 ; PMID:3110162 
A; Accession: A284 60 
A;Molecule type: mRNA 

A; Residues: 1-118, 1 G 1 , 120-240, ' L 1 , 242-423, 'S', 425-499, 1 T ' , 501-530 <MAC> 
A; Experimental source: liver 

R;Yamashita, A.; Watanabe, M. ; Tonegawa, T.; Sugiura, T. ; Waku, K. 
Biochem. J. 312, 301-308, 1995 

A;Title: Acyl-CoA binding and acylation of UDP-glucuronosyltransf erase isoforms 

of rat liver: their effect on enzyme activity. 

A; Reference number: S59626; MUID: 96077159; PMID:7492328 

A/Accession: S59626 

A;Molecule type: protein 

A; Residues: 24-44 <YAM> 

A; Experimental source: liver 

C; Superfamily : glucuronosyltransf erase 

C; Keywords: glycosyltransf erase; hexosyltransf erase; transmembrane protein 
F; 1-23/Domain: signal sequence #status predicted <SIG> 
F;24-530/Product : glucuronosyltransf erase #status experimental <MAT> 
F; 494-510/Domain: transmembrane #status predicted <TMM> 

Query Match 23.1%; Score 640; DB 2; Length 530; 

Best Local Similarity 34.2%; Pred. No. 3.2e-42; 

Matches 167; Conservative 81; Mismatches 189; Indels 52; Gaps 15; 

Qy 34 S H YLLMDRVS Q I LQ DHGHN VT MLNHKRGPFMPDFKKEEKSYQVISWLAPEDH 85 

||:::: I I I I I : I : I : I I I I I : : : 

Db 34 SHWMNIKTILDELVQRGHEVTVLKPSAYYVLDPKKS — PDLKFETFPTSV SKDEL 86 

Qy 86 QREFKKSFDFFLEE TLGGRGKFENLLNVLE — YLALQCSHFLNRKDIMDSLKNEN 138 

: I I I : I I : I : : : I I : I I : : I : I I : 

Db 87 ENYFIKLVDVWTYELQRDTCLSYSPLLQNMIDEFSDYYLSL-CKDTVSNKQLMAKLQESK 145 

Qy 139 FDMVIVETFDYCPFLIAEKLGKPFVAILSTSFG SLEFGLPI PLS YVPVFRSLLT 192 

II:::: I I II I I I I : I II I II I I I I I I I : 

Db 14 6 FDVLLSDPVAACGELIAEVLHI PFLYSLRFSPGYKI EKSSGRFIL — PPSYVPVILSGMG 203 



Qy 193 DHMD FWGRVKN FL — MFFS FCRRQQHM — QSTFDNTIKEHFTEGSRPVLSHLLLKAELWF 24 8 

I I I I I I : : : I I II : I III I : : I I I : I 

Db 204 GPMTFIDRVKNMICTLYFDF WFHMFNAKKWDPFYSEI F — GRPTTLAETMGKAEMWL 258 

Qy 24 9 INSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTLGSMVNTCQ 308 

II: : I I I I I 1:111 : I I I : I : I : I : I : I : I I : : I I I I I : : 
Db 259 IRSYWDLEFPHPTLPNVDYIGGLQCRPPKPLPKDMEDFVQSSGEHGVWFSLGSMVSS — 316 

Qy 309 NPEIFKEMNN AFAHLPQGVIWKCQCSHWPKDVHLAANVKIVDWLPQSDLLAHPSIR 364 

: : I I I I : I I I : I I I I : : I I I I : I I I I I : 



Db 317 MTEEKANAIAWALAQI PQKVLWKFD GKTPATLGPNTRVYKWLPQNDLLGHPKTK 370 

Qy 365 LFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKTVETIJVLm 424 

I I I I I I I : I I I I I : I I I I I I : I I : I : I : : I I I : : : : : I : 

Db 371 AFVTHGGANGVYEAIYHGIPMVGIPMFGEQHDNIAHMVAKGAAVTLNIRTMSKTDLFNAL 430 

Qy 425 KQ I ME D KR YK S AAVAAS VILRSHPLSPTQ RL VGW I DH VLQT GGAT H L K P YVFQQ P WH EQ Y 484 

I : I : : II II II hi : I I I : I : : I I I I : I II-- : 

Db 4 31 KEIINNPFYKKNAVWLSTIHHDQPMKPLDKAVFWIEFVMRHKGAKHLRPLGHDLPWYQYH 490 

Qy 4 85 LFDVFVFLL 4 93 

II III 

Db 4 91 SLDVIGFLL 4 99 



Search completed: May 7, 2004, 17:33:21 
Job time : 22 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



May 7, 2004, 17:31:25 



Search time 4 8 Seconds 
(without alignments) 
3024.321 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



Searched: 



US-10-017-867A-282 
2768 

1 MAGQRVLLLVGFLLPGVLLS . 



. GKLLGMAWWLRGARKVKET 523 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 

1140673 seqs, 277566755 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



1140673 



Database : Published_Applications_AA: * 

1 : /cgn2_6/ptodata/2/pubpaa/US07_PUBCOMB. pep : * 

2: /cgn2_6/ptodata/2/pubpaa/PCT_NEW_PUB.pep: * 

3: /cgn2_6/ptodata/2/pubpaa/US06_NEW_PUB.pep:* 

4 : /cgn2_6/ptodata/2/pubpaa/US06_PUBCOMB . pep : * 

5: /cgn2_6/ptodata/2/pubpaa/US07_NEW_PUB.pep:* 

6: /cgn2_6/ptodata/2/pubpaa/PCTUS_PUBCOMB.pep:* 

7 : /cgn2_6/ptodata/2/pubpaa/US08_NEW_PUB.pep: * 

8 : /cgn2_6/ptodata/2/pubpaa/US08_PUBCOMB . pep : * 

9: /cgn2_6/ptodata/2/pubpaa/US09A_PUBCOMB.pep:* 
10 : /cgn2_6/ptodata/2/pubpaa/US09B_PUBCOMB .pep : * 
11: /cgn2_6/ptodata/2/pubpaa/US09C_PUBCOMB.pep: * 
12: /cgn2_6/ptodata/2/pubpaa/US09_NEW_PUB.pep:* 
13 : /cgn2_6/ptodata/2/pubpaa/US10A_PUBCOMB . pep : * 
14: /cgn2_6/ptodata/2/pubpaa/US10B_PUBCOMB.pep: * 
15: /cgn2_6/ptodata/2/pubpaa/USlOC_PUBCOMB.pep: * 
16: /cgn2_6/ptodata/2/pubpaa/US10_NEW_PUB.pep:* 
17: /cgn2_6/ptodata/2/pubpaa/US60_NEW_PUB.pep:* 
18 : /cgn2_6/ptodata/2/pubpaa/US60_PUBCOMB.pep:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No. 



Score 



% 

Query 

Match Length DB 



ID 



Description 



1 


2768 


100. 


0 


523 


10 


us- 


09- 


946- 


374-282 


Sequence 


282, 


App 


2 


2768 


100. 


0 


523 


12 


us- 


10- 


006- 


485A-282 


Sequence 


282, 


App 


3 


2768 


100. 


0 


523 


12 


us- 


10- 


013- 


907A-282 


Sequence 


282, 


App 


4 


2768 


100. 


0 


523 


12 


us- 


10- 


015- 


499A-282 


Sequence 


282, 


App 


5 


2768 


100. 


0 


523 


12 


us- 


10- 


013- 


910A-282 


Sequence 


282, 


App 


6 


2768 


100. 


0 


523 


12 


us- 


10- 


226- 


254A-282 


Sequence 


282, 


App 


7 


2768 


100. 


0 


523 


12 


us- 


10- 


015- 


395A-282 


Sequence 


282, 


App 


8 


2768 


100. 


0 


523 


14 


us- 


10- 


006- 


856A-282 


Sequence 


282, 


App 


9 


2768 


100. 


0 


523 


14 


us- 


10- 


006- 


818A-282 


Sequence 


282, 


App 


10 


2768 


100. 


0 


523 


14 


us- 


10- 


015- 


393A-282 


Sequence 


282, 


App 


11 


2768 


100. 


0 


523 


14 


us- 


10- 


015- 


869A-282 


Sequence 


282, 


App 


12 


2768 


100. 


0 


523 


14 


us- 


10- 


012- 


121A-282 


Sequence 


282, 


App 


13 


2768 


100. 


0 


523 


14 


us- 


10- 


006- 


116A-282 


Sequence 


282, 


App 


14 


2768 


100. 


0 


523 


14 


us- 


10- 


006- 


117A-282 


Sequence 


282, 


App 


15 


2768 


100. 


0 


523 


14 


us- 


10- 


017- 


527A-282 


Sequence 


282, 


App 


16 


2768 


100. 


0 


523 


14 


us- 


10- 


013- 


913A-282 


Sequence 


282, 


App 


17 


2768 


100. 


0 


523 


14 


us- 


10- 


007- 


194A-282 


Sequence 


282, 


App 


18 


2768 


100. 


0 


523 


14 


us- 


10- 


013- 


430A-282 


Sequence 


282, 


App 


19 


2768 


100. 


0 


523 


14 


us- 


10- 


011- 


671A-282 


Sequence 


282, 


App 


20 


2768 


100. 


0 


523 


14 


us- 


10- 


012- 


755A-282 


Sequence 


282, 


App 


21 


2768 


100. 


0 


523 


14 


us- 


10- 


015- 


38 6A-2 82 


Sequence 


282, 


App 


22 


2768 


100. 


0 


523 


14 


us- 


10- 


011- 


692A-2 82 


Sequence 


282, 


App 


23 


2768 


100. 


0 


523 


14 


us- 


10- 


006- 


768A-282 


Sequence 


282, 


App 


24 


2768 


100. 


0 


523 


14 


us- 


10- 


017- 


610A-282 


Sequence 


282, 


App 


25 


2768 


100. 


0 


523 


14 


us- 


10- 


006- 


063A-282 


Sequence 


282, 


App 


26 


2768 


100. 


0 


523 


14 


us- 


10- 


020- 


063A-282 


Sequence 


282, 


App 


27 


2768 


100. 


0 


523 


14 


us- 


10- 


015- 


391A-282 


Sequence 


282, 


App 


28 


2768 


100. 


0 


523 


14 


us- 


10- 


017- 


407A-282 


Sequence 


282, 


App 


29 


2768 


100. 


0 


523 


14 


us- 


10- 


011- 


833A-282 


Sequence 


282, 


App 


30 


2768 


100. 


0 


523 


14 


us- 


10- 


006- 


041A-282 


Sequence 


282, 


App 


31 


2768 


100. 


0 


523 


14 


us- 


10- 


015- 


822A-282 


Sequence 


282, 


App 


32 


2768 


100. 


0 


523 


14 


us- 


10- 


015- 


387A-282 


Sequence 


282, 


App 


33 


2768 


100. 


0 


523 


14 


us- 


10- 


006- 


130A-282 


Sequence 


282, 


App 


34 


2768 


100. 


0 


523 


14 


us- 


10- 


006- 


172A-282 


Sequence 


282, 


App 


35 


2768 


100. 


0 


523 


14 


us- 


10- 


017- 


253A-282 


Sequence 


282, 


App 


36 


2768 


100. 


0 


523 


14 


us- 


10- 


015- 


392A-282 


Sequence 


282, 


App 


37 


2768 


100. 


0 


523 


14 


us- 


10- 


017- 


306A-282 


Sequence 


282, 


App 


38 


2768 


100. 


0 


523 


14 


us- 


10- 


017- 


867A-282 


Sequence 


282, 


App 


39 


2768 


100. 


0 


523 


14 


us- 


10- 


012- 


064A-282 


Sequence 


282, 


App 


40 


2768 


100. 


0 


523 


14 


us- 


10- 


013- 


909A-282 


Sequence 


282, 


App 


41 


2768 


100. 


0 


523 


14 


us- 


10- 


015- 


671A-282 


Sequence 


282, 


App 


42 


2768 


100. 


0 


523 


14 


us- 


10- 


015- 


610A-282 


Sequence 


282, 


App 


43 


2768 


100. 


0 


523 


14 


us- 


10- 


012- 


137A-282 


Sequence 


282, 


App 


44 


2768 


100. 


0 


523 


14 


us- 


10- 


012- 


752A-282 


Sequence 


282, 


App 


45 


2768 


100. 


0 


523 


14 


us- 


10- 


012- 


754A-282 


Sequence 


282, 


App 



ALIGNMENTS 



RESULT 1 

US-09-946-374-282 

; Sequence 282, Application US/09946374 

; Publication No. US20030073129A1 

; GENERAL INFORMATION: 

; APPLICANT: Baker, Kevin P. 



APPLICANT: Botstein, David 
APPLICANT: Desnoyers, Luc 
APPLICANT: Eaton, Dan L. 
APPLICANT: Ferrara, Napoleone 
APPLICANT: Fong, Sherman 
APPLICANT: Gao, Wei-Qiang 
APPLICANT: Goddard, Audrey 
APPLICANT: Godowski, Paul J. 
APPLICANT: Grimaldi, Christopher J. 
APPLICANT: Gurney, Austin L. 
APPLICANT: Hillan, Kenneth J. 
APPLICANT: Pan, James 
APPLICANT: Paoni, Nicholas F. 
APPLICANT: Roy, Margaret Ann 
APPLICANT: Smith, Victoria 
APPLICANT: Stewart, Timothy A. 
APPLICANT: Tumas, Daniel 
APPLICANT: Watanabe, Colin K. 
APPLICANT: Williams, P. Mickey 
APPLICANT: Wood, William I. 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2830P1C1 

CURRENT APPLICATION NUMBER: US/09/946, 374 
CURRENT FILING DATE: 2001-09-04 
PRIOR APPLICATION NUMBER: 60/098716 
PRIOR FILING DATE: 1998-09-01 
PRIOR APPLICATION NUMBER: 60/098723 
PRIOR FILING DATE: 1998-09-01 
PRIOR APPLICATION NUMBER: 60/098749 
PRIOR FILING DATE: 1998-09-01 
PRIOR APPLICATION NUMBER: 60/098750 
PRIOR FILING DATE: 1998-09-01 
PRIOR APPLICATION NUMBER: 60/098803 
PRIOR FILING DATE: 1998-09-02 
PRIOR APPLICATION NUMBER: 60/098821 
PRIOR FILING DATE: 1998-09-02 
PRIOR APPLICATION NUMBER: 60/098 843 
PRIOR FILING DATE: 1998-09-02 
PRIOR APPLICATION NUMBER: 60/099536 
PRIOR FILING DATE: 1998-09-09 
PRIOR APPLICATION NUMBER: 60/099596 
PRIOR FILING DATE: 1998-09-09 
PRIOR APPLICATION NUMBER: 60/099598 
PRIOR FILING DATE: 1998-09-09 
PRIOR APPLICATION NUMBER: 60/099602 
PRIOR FILING DATE: 1998-09-09 
PRIOR APPLICATION NUMBER: 60/099642 
PRIOR FILING DATE: 1998-09-09 
PRIOR APPLICATION NUMBER: 60/099741 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/099754 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/099763 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/099792 
PRIOR FILING DATE: 1998-09-10 • 



PRIOR APPLICATION NUMBER: 60/099808 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/099812 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/099815 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/099816 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/100385 
PRIOR FILING DATE: 1998-09-15 
PRIOR APPLICATION NUMBER: 60/100388 
PRIOR FILING DATE: 1998-09-15 
PRIOR APPLICATION NUMBER: 60/100390 
PRIOR FILING DATE: 1998-09-15 
PRIOR APPLICATION NUMBER: 60/100584 
PRIOR FILING DATE: 1998-09-16 
PRIOR APPLICATION NUMBER: 60/100627 
PRIOR FILING DATE: 1998-09-16 
PRIOR APPLICATION NUMBER: 60/100661 
PRIOR FILING DATE: 1998-09-16 
PRIOR APPLICATION NUMBER: 60/100662 
PRIOR FILING DATE: 1998-09-16 
PRIOR APPLICATION NUMBER: 60/100664 
PRIOR FILING DATE: 1998-09-16 
PRIOR APPLICATION NUMBER: 60/100683 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/100684 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/100710 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/100711 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/100848 
PRIOR FILING DATE: 1998-09-18 
PRIOR APPLICATION NUMBER: 60/100849 
PRIOR FILING DATE: 1998-09-18 
PRIOR APPLICATION NUMBER: 60/100919 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/100930 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/101014 
PRIOR FILING DATE: 1998-09-18 
PRIOR APPLICATION NUMBER: 60/101068 
PRIOR FILING DATE: 1998-09-18 
PRIOR APPLICATION NUMBER: 60/101071 
PRIOR FILING DATE: 1998-09-18 
PRIOR APPLICATION NUMBER: 60/101279 
PRIOR FILING DATE: 1998-09-22 
PRIOR APPLICATION NUMBER: 60/101471 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101472 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101474 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101475 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101476 



PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101477 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101479 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101738 
PRIOR FILING DATE: 1998-09-24 
PRIOR APPLICATION NUMBER: 60/101741 
PRIOR FILING DATE: 1998-09-24 
PRIOR APPLICATION NUMBER: 60/101743 
PRIOR FILING DATE: 1998-09-24 
PRIOR APPLICATION NUMBER: 60/101915 
PRIOR FILING DATE: 1998-09-24 
PRIOR APPLICATION NUMBER: 60/101916 
PRIOR FILING DATE: 1998-09-24 
PRIOR APPLICATION NUMBER: 60/102207 
PRIOR FILING DATE: 1998-09-29 
PRIOR APPLICATION NUMBER: 60/102240 
PRIOR FILING DATE: 1998-09-29 
PRIOR APPLICATION NUMBER: 60/102307 
PRIOR FILING DATE: 1998-09-29 
PRIOR APPLICATION NUMBER: 60/102330 
PRIOR FILING DATE: 1998-09-29 
PRIOR APPLICATION NUMBER: 60/102331 
PRIOR FILING DATE: 1998-09-29 
PRIOR APPLICATION NUMBER: 60/102484 
PRIOR FILING DATE: 1998-09-30 
PRIOR APPLICATION NUMBER: 60/102487 
PRIOR FILING DATE: 1998-09-30 
PRIOR APPLICATION NUMBER: 60/102570 
PRIOR FILING DATE: 1998-09-30 
PRIOR APPLICATION NUMBER: 60/102571 
PRIOR FILING DATE: 1998-09-30 
PRIOR APPLICATION NUMBER: 60/102684 
PRIOR FILING DATE: 1998-10-01 
PRIOR APPLICATION NUMBER: 60/102687 
PRIOR FILING DATE: 1998-10-01 
PRIOR APPLICATION NUMBER: 60/102965 
PRIOR FILING DATE: 1998-10-02 
PRIOR APPLICATION NUMBER: 60/103258 
PRIOR FILING DATE: 1998-10-06 
PRIOR APPLICATION NUMBER: 60/103314 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103315 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103328 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103395 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103396 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103401 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103449 
PRIOR FILING DATE: 1998-10-06 
PRIOR APPLICATION NUMBER: 60/103633 
PRIOR FILING DATE: 1998-10-08 



PRIOR APPLICATION NUMBER: 60/103678 
PRIOR FILING DATE: 1998-10-08 
PRIOR APPLICATION NUMBER: 60/103679 
PRIOR FILING DATE: 1998-10-08 
PRIOR APPLICATION NUMBER: 60/103711 
PRIOR FILING DATE: 1998-10-08 
PRIOR APPLICATION NUMBER: 60/104257 
PRIOR FILING DATE: 1998-10-14 
PRIOR APPLICATION NUMBER: 60/104987 
PRIOR FILING DATE: 1998-10-20 
PRIOR APPLICATION NUMBER: 60/105000 
PRIOR FILING DATE: 1998-10-20 
PRIOR APPLICATION NUMBER: 60/105002 
PRIOR FILING DATE: 1998-10-20 
PRIOR APPLICATION NUMBER: 60/105104 
PRIOR FILING DATE: 1998-10-21 
PRIOR APPLICATION NUMBER: 60/105169 
PRIOR FILING DATE: 1998-10-22 
PRIOR APPLICATION NUMBER: 60/105266 
PRIOR FILING DATE: 1998-10-22 
PRIOR APPLICATION NUMBER: 60/105693 
PRIOR FILING DATE: 1998-10-26 
PRIOR APPLICATION NUMBER: 60/105694 
PRIOR FILING DATE: 1998-10-26 
PRIOR APPLICATION NUMBER: 60/1058 07 



Query Match 100.0%; Score 2768; DB 10; Length 523; 

Best Local Similarity 100.0%; Pred. No. 6.6e-272; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1 


MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 


60 




I I I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1 


MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 


60 


Qy 


61 


GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 


120 




1 1 1 1 1 Ill 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


61 


GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 


120 


Qy 


121 


CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 


180 




1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 M 1 1 1 1 




Db 


121 


CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 


180 


Qy 


181 


LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M II 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


181 


LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 


240 


Qy 


241 


LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 


300 






I I I I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


241 


LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 


300 


Qy 


301 


GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKD 


360 






1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 




Db 


301 


GSMVNTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDV11LAANVKIVDWLPQSDLLAH 


360 


Qy 


361 


PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 


420 




1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 ii i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 




Db 


361 


PS I RLFVTHGGQNS IMEAIQHGVPMVGI PLFGDQPENMVRVEAKKFGVS IQLKKLKAETL 


420 



Qy 421 ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 4 80 

I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I I I II II I I I I I I I I I I I I I I 

Db 421 ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 480 

Qy 481 HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 

II I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I II I II I I I 

Db 481 HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 



RESULT 2 

US-10-006-485A-282 

Sequence 282, Application US/10006485A 
Publication No. US20030064062A1 
GENERAL INFORMATION: 
APPLICANT: Baker, Kevin P. 
APPLICANT: Botstein, David 
APPLICANT: Desnoyers, Luc 
APPLICANT: Eaton, Dan 1. 
APPLICANT: Ferrara, Napoleone 
APPLICANT: Fong, Sherman 
APPLICANT: Gao, Wei-Qiang 
APPLICANT: Goddard, Audrey 
APPLICANT: Godowski, Paul J. 
APPLICANT: Grimaldi, Christopher J. 
APPLICANT: Gurney, Austin L. 
APPLICANT: Hillan, Kenneth J. 
APPLICANT: Pan, James 
APPLICANT: Paoni, Nicholas F. 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2830P1C9 

CURRENT APPLICATION NUMBER: US/10/006, 485A 
CURRENT FILING DATE: 2001-12-06 
PRIOR APPLICATION NUMBER: 60/098716 
PRIOR FILING DATE: 1998-09-01 
PRIOR APPLICATION NUMBER: 60/098723 
PRIOR FILING DATE: 1998-09-01 
PRIOR APPLICATION NUMBER: 60/098749 
PRIOR FILING DATE: 1998-09-01 
PRIOR APPLICATION NUMBER: 60/098750 
PRIOR FILING DATE: 1998-09-01 
PRIOR APPLICATION NUMBER: 60/098803 
PRIOR FILING DATE: 1998-09-02 
PRIOR APPLICATION NUMBER: 60/098821 
PRIOR FILING DATE: 1998-09-02 
PRIOR APPLICATION NUMBER: 60/098843 
PRIOR FILING DATE: 1998-09-02 
PRIOR APPLICATION NUMBER: 60/099536 
PRIOR FILING DATE: 1998-09-09 
PRIOR APPLICATION NUMBER: 60/099596 
PRIOR FILING DATE: 1998-09-09 
PRIOR APPLICATION NUMBER: 60/099598 
PRIOR FILING DATE: 1998-09-09 
PRIOR APPLICATION NUMBER: 60/099602 
PRIOR FILING DATE: 1998-09-09 
PRIOR APPLICATION NUMBER: 60/099642 



PRIOR FILING DATE: 1998-09-09 
PRIOR APPLICATION NUMBER: 60/099741 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/099754 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/099763 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/099792 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/099808 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/099812 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/099815 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/099816 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/100385 
PRIOR FILING DATE: 1998-09-15 
PRIOR APPLICATION NUMBER: 60/100388 
PRIOR FILING DATE: 1998-09-15 
PRIOR APPLICATION NUMBER: 60/100390 
PRIOR . FILING DATE : 1998-09-15 
PRIOR APPLICATION NUMBER: 60/100584 
PRIOR FILING DATE: 1998-09-16 
PRIOR APPLICATION NUMBER: 60/100627 
PRIOR FILING DATE: 1998-09-16 
PRIOR APPLICATION NUMBER: 60/100661 
PRIOR FILING DATE: 1998-09-16 
PRIOR APPLICATION NUMBER: 60/100662 
PRIOR FILING DATE: 1998-09-16 
PRIOR APPLICATION NUMBER: 60/100664 
PRIOR FILING DATE: 1998-09-16 
PRIOR APPLICATION NUMBER: 60/100683 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/100684 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/100710 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/100711 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/100848 
PRIOR FILING DATE: 1998-09-18 
PRIOR APPLICATION NUMBER: 60/100849 
PRIOR FILING DATE: 1998-09-18 
PRIOR APPLICATION NUMBER: 60/100919 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/100930 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/101014 
PRIOR FILING DATE: 1998-09-18 
PRIOR APPLICATION NUMBER: 60/101068 
PRIOR FILING DATE: 1998-09-18 
PRIOR APPLICATION NUMBER: 60/101071 
PRIOR FILING DATE: 1998-09-18 
PRIOR APPLICATION NUMBER: 60/101279 
PRIOR FILING DATE: 1998-09-22 



PRIOR APPLICATION NUMBER: 60/101471 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101472 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101474 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101475 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101476 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101477 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101479 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101738 
PRIOR FILING DATE: 1998-09-24 
PRIOR APPLICATION NUMBER: 60/101741 
PRIOR FILING DATE: 1998-09-24 
PRIOR APPLICATION NUMBER: 60/101743 
PRIOR FILING DATE: 1998-09-24 
PRIOR APPLICATION NUMBER: 60/101915 
PRIOR FILING DATE: 1998-09-24 
PRIOR. APPLICATION NUMBER: 60/101916 
PRIOR FILING DATE: 1998-09-24 
PRIOR APPLICATION NUMBER: 60/102207 
PRIOR FILING DATE: 1998-09-29 
PRIOR APPLICATION NUMBER: 60/102240 
PRIOR FILING DATE: 1998-09-29 
PRIOR APPLICATION NUMBER: 60/102307 
PRIOR FILING DATE: 1998-09-29 
PRIOR APPLICATION NUMBER: 60/102330 
PRIOR FILING DATE: 1998-09-29 
PRIOR APPLICATION NUMBER: 60/102331 
PRIOR FILING DATE: 1998-09-29 
PRIOR APPLICATION NUMBER: 60/102484 
PRIOR FILING DATE: 1998-09-30 
PRIOR APPLICATION NUMBER: 60/102487 
PRIOR FILING DATE: 1998-09-30 
PRIOR APPLICATION NUMBER: 60/102570 
PRIOR FILING DATE: 1998-09-30 
PRIOR APPLICATION NUMBER: 60/102571 
PRIOR FILING DATE: 1998-09-30 
PRIOR APPLICATION NUMBER: 60/102684 
PRIOR FILING DATE: 1998-10-01 
PRIOR APPLICATION NUMBER: 60/102687 
PRIOR FILING DATE: 1998-10-01 
PRIOR APPLICATION NUMBER: 60/102965 
PRIOR FILING DATE: 1998-10-02 
PRIOR APPLICATION NUMBER: 60/103258 
PRIOR FILING DATE: 1998-10-06 
PRIOR APPLICATION NUMBER: 60/103314 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103315 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103328 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103395 



PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103396 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103401 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103449 
PRIOR FILING DATE: 1998-10-06 
PRIOR APPLICATION NUMBER: 60/103633 
PRIOR FILING DATE: 1998-10-08 
PRIOR APPLICATION NUMBER: 60/103678 
PRIOR FILING DATE: 1998-10-08 
PRIOR APPLICATION NUMBER: 60/103679 
PRIOR FILING DATE: 1998-10-08 
PRIOR APPLICATION NUMBER: 60/103711 
PRIOR FILING DATE: 1998-10-08 
PRIOR APPLICATION NUMBER: 60/104257 
PRIOR FILING DATE: 1998-10-14 
PRIOR APPLICATION NUMBER: 60/104987 
PRIOR FILING DATE: 1998-10-20 
PRIOR APPLICATION NUMBER: 60/105000 
PRIOR FILING DATE: 1998-10-20 
PRIOR APPLICATION NUMBER: 60/105002 
PRIOR FILING DATE: 1998-10-20 
PRIOR APPLICATION NUMBER: 60/105104 
PRIOR FILING DATE: 1998-10-21 
PRIOR APPLICATION NUMBER: 60/105169 
PRIOR FILING DATE: 1998-10-22 
PRIOR APPLICATION NUMBER: 60/105266 
PRIOR FILING DATE: 1998-10-22 
PRIOR APPLICATION NUMBER: 60/105693 
PRIOR FILING DATE: 1998-10-26 
PRIOR APPLICATION NUMBER: 60/105694 
PRIOR FILING DATE: 1998-10-26 
PRIOR APPLICATION NUMBER: 60/105807 
PRIOR FILING DATE: 1998-10-27 
PRIOR APPLICATION NUMBER: 60/105881 
PRIOR FILING DATE: 1998-10-27 
PRIOR APPLICATION NUMBER: 60/105882 
PRIOR FILING DATE: 1998-10-27 
PRIOR APPLICATION NUMBER: 60/106023 
PRIOR FILING DATE: 1998-10-28 

Query Match 100.0%; Score 2768; DB 12; Length 523; 

Best Local Similarity 100.0%; Pred. No. 6.6e-272; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

Qy 61 GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 

Qy 121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 



Qy 


181 


LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


lb 1 


LSWPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLbnL 


Z ft U 


Qy 


241 


LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 


300 






1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 








LLKAhLWr INSDFAFDFAKPLLrIM 1 VYVGGLMEKPIKPVPQDLENFIAKFGDSGrVLVl L 


o n n 
3UU 


Qy 


301 


GSIWNTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHLAANVKIVDWLPQSDLLAH 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


GSIWNTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVH 


o c c\ 

360 


Qy 


361 


PSIRLFVTHGGQNSIME^\IQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 


420 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


P S I RL FVT H G GQN S IMEAI QH GVPMVG I P L FG DQ P ENMVRVEAKK FGVS I Q LKKLKAET L 


420 


Qy 


421 


ALKMKQ I ME D KR YK S AAVAAS VI L RS H P L S P TQ RLVGW I DHVLQT G GATH L KP YVFQQ PW 


480 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


421 


ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 


480 


Qy 


481 


HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 








1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


481 


HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 





RESULT 3 

US-10-013-907A-282 

Sequence 282, Application US/10013907A 
Publication No. US20030064925A1 
GENERAL INFORMATION: 
APPLICANT: Baker, Kevin P. 
APPLICANT: Botstein, David 
APPLICANT: Desnoyers, Luc 
APPLICANT: Eaton, Dan 1. 
APPLICANT: Ferrara, Napoleone 
APPLICANT: Fong, Sherman 
APPLICANT: Gao, Wei-Qiang 
APPLICANT: Goddard, Audrey 
APPLICANT: Godowski, Paul J. 
APPLICANT: Grimaldi, Christopher J. 
APPLICANT: Gurney, Austin L. 
APPLICANT: Hillan, Kenneth J. 
APPLICANT: Pan, James 
APPLICANT: Paoni, Nicholas F. 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2830P1C34 

CURRENT APPLICATION NUMBER: US/ 10/013 , 907A 
CURRENT FILING DATE: 2001-12-10 

Prior Application removed - See File Wrapper or Palm 
NUMBER OF SEQ ID NOS : 477 
SEQ ID NO 282 
LENGTH: 523 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-013-907A-282 



Query Match 100.0%; Score 2768; DB 12; Length 523; 

Best Local Similarity 100.0%; Pred. No. 6.6e-272; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 



0; 



Qy 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

Qy 61 GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 

Qy 121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 18 0 

I i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M T 1 1 

Db 121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 

Qy 181 LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 240 

Qy 241 LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 300 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 300 

Qy 301 GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSH^ 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 301 GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDW 360 

Qy 361 PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMWVEAJ<KFGVSIQLKKLK7VETL 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 361 PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 420 

Qy 421 AL KMKQ IME D K R YK S AAVAAS VI L RS H P L S PT QRLVGW I DHVLQT GGAT HL K P YVFQQ PW 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKP YVFQQ PW 480 

Qy 4 81 HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAWWLRGARKVKET 523 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I 
Db 481 HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 



RESULT 4 

US-10-015-499A-282 

Sequence 282, Application US/10015499A 
Publication No. US20030065142A1 
GENERAL INFORMATION: 
APPLICANT : Baker, Kevin P. 
APPLICANT: Botstein, David 
APPLICANT: Desnoyers, Luc 
APPLICANT: Eaton, Dan 1. 
APPLICANT: Ferrara, Napoleone 
APPLICANT: Fong, Sherman 
APPLICANT: Gao, Wei-Qiang 
APPLICANT: Goddard, Audrey 
APPLICANT: Godowski, Paul J. 
APPLICANT: Grimaldi, Christopher J. 



APPLICANT: Gurney, Austin L. 
APPLICANT: Hillan, Kenneth J. 
APPLICANT: Pan, James 
APPLICANT: Paoni, Nicholas F. 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2830P1C42 

CURRENT APPLICATION NUMBER: US/10/015, 499A 
CURRENT FILING DATE: 2 001-12-11 

Prior Application removed - See File Wrapper or Palm 
NUMBER OF SEQ ID NOS : 477 
SEQ ID NO 282 
LENGTH: 523 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-015-499A-282 

Query Match 100.0%; Score 2768; DB 12; Length 523; 

Best Local Similarity 100.0%; Pred. No. 6.6e-272; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

MAGQRVLLLVGFLLPGVLLS EAAKI LT I STVGGSHYLLMDRVSQI LQDHGHNVTMLNHKR 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I Mil-Ill 

MAGQRVLLLVGFLLPGVLLS EAAKI LT I STVGGSHYLLMDRVSQI LQDHGHNVTMLNHKR 6 0 

GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 

CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 

I M I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 

LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 240 

I | | | | | | | | I I I I I I I I I.I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 24 0 

LLKAELWFINSDFAFDFARPLLPNTVTVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 300 

GSMVNTCQNPEI FKEMNNAFAHLPQGVIWKCQCSHWPKD VHLAANVKI VDWLPQSDLLAH 360 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDV1ILAANVKIVDWLPQSDLLAH 360 

PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 420 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
P S I RL FVTHGGQN S I MEAI QHGVPMVG I P L FGDQ P ENMVRVEAKK FGVS I QLKKLKAET L 42 0 

ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 48 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 480 

HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 





1 


Db 


1 


Ov 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 



RESULT 5 

US-10-013-910A-282 

Sequence 282, Application US/10013910A 
Publication No. US20030187192A1 
GENERAL INFORMATION: 
APPLICANT: Baker, Kevin P. 
APPLICANT: Botstein, David 
APPLICANT: Desnoyers, Luc 
APPLICANT: Eaton, Dan 1. 
APPLICANT: Ferrara, Napoleone 
APPLICANT: Fong, Sherman 
APPLICANT: Gao, Wei-Qiang 
APPLICANT: Goddard, Audrey 
APPLICANT: Godowski, Paul J. 
APPLICANT: Grimaldi, Christopher J. 
APPLICANT: Gurney, Austin L. 
APPLICANT: Hillan, Kenneth J. 
APPLICANT: Pan, James 
APPLICANT: Paoni, Nicholas F. 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2830P1C33 

CURRENT APPLICATION NUMBER: US/10/013, 910A 
CURRENT FILING DATE: 2001-12-10 
PRIOR APPLICATION NUMBER: 60/098716 
PRIOR FILING DATE: 1998-09-01 
PRIOR APPLICATION NUMBER: 60/098723 
PRIOR FILING DATE: 1998-09-01 
PRIOR APPLICATION NUMBER: 60/098749 
PRIOR FILING DATE: 1998-09-01 
PRIOR APPLICATION NUMBER: 60/098750 
PRIOR FILING DATE: 1998-09-01 
PRIOR APPLICATION NUMBER: 60/098803 
PRIOR FILING DATE: 1998-09-02 
PRIOR APPLICATION NUMBER: 60/098821 
PRIOR FILING DATE: 1998-09-02 
PRIOR APPLICATION NUMBER: 60/098843 
PRIOR FILING DATE: 1998-09-02 
PRIOR APPLICATION NUMBER: 60/099536 
PRIOR FILING DATE: 1998-09-09 
PRIOR APPLICATION NUMBER: 60/099596 
PRIOR FILING DATE: 1998-09-09 
PRIOR APPLICATION NUMBER: 60/099598 
PRIOR FILING DATE: 1998-09-09 

Remaining Prior Application data removed - See File Wrapper or PALM. 
NUMBER OF SEQ ID NOS : 477 
SEQ ID NO 282 
LENGTH: 523 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-013-910A-282 



Query Match 100.0%; Score 27 68; DB 12; Length 523; 

Best Local Similarity 100.0%; Pred. No. 6.6e-272; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


301 


Db 


301 


Qy 


361 


Db 


361 


Qy 


421 


Db 


421 


Qy 


481 


Db 


481 



MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I 

MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 

M I I I I II I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I 

GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 

CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 18 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 

LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 24 0 

| | I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I 

LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 24 0 

LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I 

LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 300 

GSl^TCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHLAANVKIVDWLPQSDLLAH 360 
| | | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 



PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 420 

| | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 42 0 

ALKMKQIMEDKRYKSAAVAASVI LRSHPLS PTQRLVGWI DHVLQTGGATHLKP YVFQQPW 48 0 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
ALKMKQIMEDKRYKSAAVAASVI LRSHPLS PTQRLVGWI DHVLQTGGATHLKP YVFQQPW 480 

HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I 

HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 



RESULT 6 

US-10-226-254A-282 

Sequence 282, Application US/10226254A 
Publication No. US20030224478A1 
GENERAL INFORMATION: 
APPLICANT: Baker , Kevin P. 
APPLICANT: Botstein, David 
APPLICANT: Desnoyers, Luc 
APPLICANT: Eaton, Dan 1. 
APPLICANT: Ferrara, Napoleone 
APPLICANT: Fong, Sherman 
APPLICANT: Gao, Wei-Qiang 
APPLICANT: Goddard, Audrey 
APPLICANT: Godowski, Paul J. 
APPLICANT: Grimaldi, Christopher J. 
APPLICANT: Gurney, Austin L. 
APPLICANT: Hillan, Kenneth J. 
APPLICANT: Pan, James 
APPLICANT: Paoni, Nicholas F. 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 



; TITLE OF INVENTION: Acids Encoding the Same 
; FILE REFERENCE: P2830P1C68 

; CURRENT APPLICATION NUMBER: US/10/226, 254A 
; CURRENT FILING DATE: 2002-08-21 

PRIOR APPLICATION NUMBER: 60/098716 
; PRIOR FILING DATE: 1998-09-01 
; PRIOR APPLICATION NUMBER: 60/098723 
; PRIOR FILING DATE: 1998-09-01 
; PRIOR APPLICATION NUMBER: 60/098749 
; PRIOR FILING DATE: 1998-09-01 
; PRIOR APPLICATION NUMBER: 60/098750 
; PRIOR FILING DATE: 1998-09-01 
; PRIOR APPLICATION NUMBER: 60/098803 
; PRIOR FILING DATE: 1998-09-02 
; PRIOR APPLICATION NUMBER: 60/098821 
; PRIOR FILING DATE: 1998-09-02 
; PRIOR APPLICATION NUMBER: 60/098843 
; PRIOR FILING DATE: 1998-09-02 
; PRIOR APPLICATION NUMBER: 60/099536 
; PRIOR FILING DATE: 1998-09-09 
; PRIOR APPLICATION NUMBER: 60/099596 
; PRIOR FILING DATE: 1998-09-09 
; PRIOR APPLICATION NUMBER: 60/099598 
; PRIOR FILING DATE: 1998-09-09 

; Remaining Prior Application data removed - See File Wrapper or PALM. 
; NUMBER OF SEQ ID NOS : 477 

; SEQ ID NO 282 

LENGTH: 523 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-10-226-254A-282 

Query Match 100.0%; Score 2768; DB 12; Length 523; 

Best Local Similarity 100.0%; Pred. No. 6.6e-272; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 




Db 



1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 



QY 



61 GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 




Db 



61 GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 



Qy 



121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 




Db 



121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 



Qy 



181 LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 240 




Db 



181 LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 240 



Qy 



Db 



241 LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

241 LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 300 



Qy 



3 01 GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVTiLAANVKIVT)WLPQSDLLJ\H 



360 



Db 301 GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHIJ^VKIVDWLPQSDLIAH 3 60 

Qy 361 P S I RL FVTHGGQNS IMEAI QHGVPMVGI PLFGDQP ENMVRVEAKKFGVS I QLKKLKAETL 420 

I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 361 PS I RLFWHGGQNS IMEAI QHGVPMVGI PLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 420 

Qy 421 ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 4 80 

I I I I I I I.I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 421 ALKMKQIMEDKRYKSAAV7VASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 480 

Qy 481 HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 



RESULT 7 

US-10-015-395A-282 

Sequence 282, Application US/10015395A 
Publication No. US20040073015A1 
GENERAL INFORMATION: 
APPLICANT: Baker, Kevin P. 
APPLICANT: Botstein, David 
APPLICANT: Desnoyers, Luc 
APPLICANT: Eaton, Dan 1. 
APPLICANT: Ferrara, Napoleone 
APPLICANT: Fong, Sherman 
APPLICANT: Gao, Wei-Qiang 
APPLICANT: Goddard, Audrey 
APPLICANT: Godowski, Paul J. 
APPLICANT: Grimaldi, Christopher J. 
APPLICANT: Gurney, Austin L. 
APPLICANT: Hillan, Kenneth J. 
APPLICANT: Pan, James 
APPLICANT: Paoni, Nicholas F. 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2830P1C57 

CURRENT APPLICATION NUMBER: US/10/015, 395A 
CURRENT FILING DATE: 2001-12-12 

Prior application removed - See file Wrapper or Palm 
NUMBER OF SEQ ID NOS : 477 
SEQ ID NO 282 
LENGTH: 523 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-015-395A-282 



Query Match 100.0%; Score 2768; DB 12; 

Best Local Similarity 100.0%; Pred. No. 6.6e-272; 
Matches 523; Conservative 0; Mismatches 0; 



Length 52 3; 
Indels 0; 



Gaps 



0; 



Qy 

Db 



1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

1 MAGQRVLLLVGFLLPGVLLSEAi\KILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 



Qy 61 GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 



Db 



61 



I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I M I I I I 

GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 



Qy 121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 

Qy 181 LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 240 

Qy 241 LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFI7VKFGDSGFVLVTL 300 

Qy 301 GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHIAANVKIVDWLPQSDLLAH 360 

I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 GSMVNTCQNPEIFKElvntfNAFAHLPQGV^ 360 

Qy 361 PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVET^KKFGVSIQLKKLKAETL 420 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 420 

Qy 421 ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 4 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 480 

Qy 481 HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 481 HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 



RESULT 8 

US-10-006-856A-282 

Sequence 282, Application US/10006856A 
Publication No. US20030044841A1 
GENERAL INFORMATION: 
APPLICANT: Baker, Kevin P. 
APPLICANT: Botstein, David 
APPLICANT: Desnoyers, Luc 
APPLICANT: Eaton, Dan 1. 
APPLICANT: Ferrara, Napoleone 
APPLICANT: Fong, Sherman 
APPLICANT: Gao, Wei-Qiang 
APPLICANT: Goddard, Audrey 
APPLICANT: Godowski, Paul J. 
APPLICANT: Grimaldi, Christopher J. 
APPLICANT: Gurney, Austin L. 
APPLICANT: Hillan, Kenneth J. 
APPLICANT: Pan, James 
APPLICANT: Paoni, Nicholas F. 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2830P1C14 

CURRENT APPLICATION NUMBER: US/10/006, 856A 
CURRENT FILING DATE: 2002-05-10 
NUMBER OF SEQ ID NOS: 477 



; Prior Application removed - See File Wrapper or Palm 
; SEQ ID NO 282 

LENGTH: 523 

TYPE : PRT 
; ORGANISM: Homo sapiens 
US-10-006-856A-282 



Query Match 100.0%; Score 2768; DB 14; Length 523; 

Best Local Similarity 100.0%; Pred. No. 6.6e-272; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1 


MAGQ RVLL LVG FL L P GVL L S EAAK I LT I S T VGG S H YL LMDRVS Q I LQ DH GHN V 1 MLN ti KK 


OU 






i i i i i i i i i i i i i i i i i f i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M M II 1 II 1 1 1 1 II ! 




Db 


1 


MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 


60 


Qy 


61 


GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 


ion 




■ i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


61 


GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 


120 


Qy 


121 


CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 


ion 

loU 




. i ■ ■ ill' iiitillllltlll lllllillllll 1 1 1 1 i 1 1 t 1 1 1 1 1 1 1 1 i 1 I t 1 1 

1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 




Db 


121 


CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 


180 


Qy 


181 


LSWPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 


Z 4 U 




i i i • i i ■ i i i i i i i i i i i i i i i i i i i i t > i i i i i i i i i i t i i i i i i 

I 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 




Db 


181 


LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 


240 


Qy 


241 


LLKAELWFINSDFAFDFARPLLPNTVx VGGLMhKPlKFVFyuijli.JN t lAJ\r <aDo^r vliV 1 li 


ofin 




i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i \ \ i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


241 


LLKAELWFINSDFAFDr ARPLLPN I V YVGGLMhKrlKr VryjJlirjJNr lAJ\r (jJJotjr VliV 1 J_i 


OAA 


Qy 


301 


GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVliL7W\NVKIVDWLPQ 


360 






I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHLAANVKIVDWLPQSDLLAH 


360 


Qy 


361 


PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 


420 




1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 


420 


Qy 


421 


ALKMKQIMEDKRYKSAAV7UVSVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 


480 






I I I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


421 


7VLKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 


480 


Qy 


481 


HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAWWLRGARKVKET 523 








1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


481 


HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 





RESULT 9 

US-10-006-818A-2 82 

; Sequence 282, Application US/10006818A 

; Publication No. US20030054406A1 

; GENERAL INFORMATION: 

; APPLICANT: Baker, Kevin P. 

; APPLICANT: Botstein, David 

; APPLICANT: Desnoyers, Luc 

; APPLICANT: Eaton, Dan 1. 



APPLICANT: Ferrara, Napoleone 
APPLICANT: Fong, Sherman 
APPLICANT: Gao, Wei-Qiang 
APPLICANT: Goddard, Audrey 
APPLICANT: Godowski, Paul J. 
APPLICANT: Grimaldi, Christopher J. 
APPLICANT: Gurney, Austin L. 
APPLICANT: Hillan, Kenneth J. 
APPLICANT: Pan, James 
APPLICANT: Paoni, Nicholas F. 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2830P1C4 

CURRENT APPLICATION NUMBER: US/10/006, 818A 
CURRENT FILING DATE: 2001-12-06 

Prior Application removed - See File Wrapper or Palm 
NUMBER OF SEQ ID NOS: 477 
SEQ ID NO 282 
LENGTH: 523 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-006-818A-282 

Query Match 100.0%; Score 2768; DB 14; Length 523; 

Best Local Similarity 100.0%; Pred. No. 6. 6e-272; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I II I I 

Db 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

Qy 61 GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 12 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 12 0 

Qy 121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 

Qy 181 LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 181 LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 240 

Qy 241 LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 30 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFI7VKFGDSGFVLVTL 300 

Qy 301 GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHLAANVKIVDWLPQSDLLAH 360 

I I I I I I I I I I I I I I I I I I I II I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 GSMVNTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVliLAANVKIVDWLPQSDLLAH 360 

Qy 361 PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 42 0 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 420 



QY 



421 



ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 
I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



480 



Db 421 ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 4 80 

Qy 481 HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 

I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 

Db 481 HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 



RESULT 10 

US-10-015-393A-282 

Sequence 282, Application US/10015393A 
Publication No. US20030069179A1 
GENERAL INFORMATION: 
APPLICANT: Baker, Kevin P. 
APPLICANT: Botstein, David 
APPLICANT: Desnoyers, Luc 
APPLICANT: Eaton, Dan 1. 
APPLICANT: Ferrara, Napoleone 
APPLICANT: Fong, Sherman 
APPLICANT: Gao, Wei-Qiang 
APPLICANT: Goddard, Audrey 
APPLICANT: Godowski, Paul J. 
APPLICANT: Grimaldi, Christopher J. 
APPLICANT: .Gurney, Austin L. 
APPLICANT: Hillan, Kenneth J. 
APPLICANT: Pan, James 
APPLICANT: Paoni, Nicholas F. 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2830P1C46 

CURRENT APPLICATION NUMBER: US/ 10/015, 393A 
CURRENT FILING DATE: 2002-06-10 

Prior Application removed - See File Wrapper or Palm 
NUMBER OF SEQ ID NOS : 477 
SEQ ID NO 282 
LENGTH: 523 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-015-393A-282 

Query Match 100.0%; Score 2768; DB 14; Length 523; 

Best Local Similarity 100.0%; Pred. No. 6.6e-272; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

I I I I I I I I I I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MAGQRVLLLVGFLLPGVLLSEA7VKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

Qy 61 GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 

I I I I I I I | | | | | | | | I I I I i I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 GPFMPDFKKEEKSYQVI SWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 

Qy 121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I I I I II I I I I I I I I I I 
Db 121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 



Qy 



181 LS YVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTI KEHFTEGSRPVLSHL 240 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 



Db 181 LSWPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 240 

Qy 241 LLKAELWFINSDFAFDFARPLLPNTVWGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 LLKAELWFINSDFAFDF7VRPLLPNTVWGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 300 

Qy 301 GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHLAANVKIVDWLPQS 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 301 GSMVNTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHLAANVKIVDWLPQSDLLAH 360 

Qy 361 P S I RLFVTHGGQNS IMEAI QHGVPMVGI P LFGDQPENMVRVEAKKFGVS I QLKKLKAET L 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 420 

Qy 421 ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 4 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 421 ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPWFQQPW 480 

Qy 481 HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 

I I I I I I I I II I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 481 HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 



RESULT 11 

US-10-015-869A-282 

Sequence 282, Application US/10015869A 
Publication No. US20030073130A1 
GENERAL INFORMATION: 
APPLICANT: Baker, Kevin P. 
APPLICANT: Botstein, David 
APPLICANT: Desnoyers, Luc 
APPLICANT: Eaton, Dan 1. 
APPLICANT: Ferrara, Napoleone 
APPLICANT: Fong, Sherman 
APPLICANT: Gao, Wei-Qiang 
APPLICANT: Goddard, Audrey 
APPLICANT: Godowski, Paul J. 
APPLICANT: Grimaldi, Christopher J. 
APPLICANT: Gurney, Austin L. 
APPLICANT: Hillan, Kenneth J. 
APPLICANT: Pan, James 
APPLICANT: Paoni, Nicholas F. 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2830P1C45 

CURRENT APPLICATION NUMBER: US/10/015 , 869A 
CURRENT FILING DATE: 2002-06-25 

Prior Application removed - See File Wrapper or Palm 
NUMBER OF SEQ ID NOS : 477 
SEQ ID NO 282 
LENGTH: 523 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-10-015-869A-282 



Query Match 

Best Local Similarity 



100.0%; 
100.0%; 



Score 2768; DB 14; Length 523; 
Pred. No. 6.6e-272; 



Matches 523; Conservative 0; Mismatches 



0; Indels 



0; Gaps 



0; 



Qy 


1 


MAGQRVXLLVGFLLPGVXLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 


60 






i i i i ( i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i I I I I l 1 1 11 1 1 1 1 
1 11 1 1! i II 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 II II II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 • ! 1 1 1 1 




DD 


L 


i v i/\LjyKVijJ_iij Vkde XjJjIt Kj V J_i.bo £i/\/\rv-L J_i 1 lol VLjLjoiI I ±jJ_i1 v 11JK V J. Li^s Un.ijn.ri v 1 i iJ_HMiii\rs. 


60 


Qy 


61 


GPFMPDFKKEEKSYQVISWIAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVT^EYLALQ 


120 






1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 I i 1 1 1 1 1 i 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 

1 II 1 1 1 1 1 1 1 1 1 II 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 II II 1 1 1 II 1 1 1 1 1 1 1 M I I I i i i 




Dr> 


bl 


r<n TmjfDTWPi^ vur vo vni tt ct«tt a nununDn'Trvyc Tr'nTTTPT t?t?tt rr Df vttttmt t M"\ at FYT.ZVT.D 
Kdc r MirDr KxvcjCjKo IQ VI oWiiAriljDriyKtjr ivrvor Dr r IjEjEjI lj(j^KUJ\r ^NIjIjINJ VJ-iJj i j_lMJ_iv^ 


1 20 

JL C \J 


Qy 


121 


CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 


180 






i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i ii i i I i I I 
1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l l l ll i i i i i i i 




DD 


i oi 


riQULT M DVTlTMnC T VMT?M t?r\M\ FT\ /"FT 1 TTTl Vf P T?T T 7AITT<rT C V P TT\ T T CJT 1 C! WC2 ^ T FFT^T.PT P 

Lorir J_iJN KrvD-LJYLDo LiiSJN £jin c JJJV1V JL VcjI e Uiv^rrr ijl/\Tjx\ijoi\r r v/\±1jo lor ^joi-iHii: ult _l z 


1 RO 

X U \J 


Qy 


181 


LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 


240 






i i i i i i i i i i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 11 1 1 1 1 1 1 1 1 1 1 




DD 


1 Q 1 


Lib I Vlr V r Kb J_iLil DniYiDr WLjKV J\JN r LilYlr r o r UKKyyai 1 r JJlN l ± ixtLirin l rjoor\ir v ±jon±j 




Qy 


241 


LLKAELWFINSDFAFDFARPLLPNTVTVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 


300 






i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i I I I I I I I I I I I I I I I I I I I I I I t I 
I 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 I 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


O A 1 

241 


T T T/"7\TPTTVTT7 , TVra - nT7 1 7V tPrMT 1 A DDT T DKTTlTV^TTr'T UfC 7DT VDUOnTM T?M TTT A V" TPf Q f* TT\ TT \ FV T 

LLKAhjLWr IJM bUr Ar Dr AKrLLrJN 1 V x V(j(aLMrjJ\lr 1 rvir V ryDLirjJMr l/\i\r LrDoijr VLiVlLi 


J u u 


Qy 


-301 


.-GSMVNTCQNEEIEKEMNNAFAHLPQ 


360 






1 1 1 l 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 I 1 1 1 1 
1 1 1 II 1 1 1 1 1 1 1 1 1 11 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 1 1 




Db 


301 


GSMVNTCQNPhl c KhMNNAr AnLPybVlWKCyL.oHWF 


jDU 


Qy 


361 


P S I RLFVTHGGQNS IMEAI QHGVPMVGI PLFGDQPENMVRVEAKKFGVS I QLKKLKAETL 


420 






i i i i i i i i I I I t I I I I I I I I I I I I I I l i l l l l l l l l l 1 l 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 
II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 




DD 




ncTDT TTA/TurrriMc;TM'l?7ATriMr'\/'Dl\A/'rT'DT T^r , nr^PT^MT\/r\rD'\/Tr , 7A'K P VWfZM^. TOT 'PfKT.K'B.RTT. 
c o X KliT V I noLiyjN o J_1Y1£jl/\-L \J rl Kdv if IYIVLtI r LtC ^»JJy r EjlNlYlVKVru/\J\J\r ovo X^±jrvJ\ijiYrtJiil J_i 


rt U 


Qy 


421 


ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDrTVXQTGGATHLKPYVFQQPW 


480 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


421 


ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 


480 


Qy 


481 


HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 . 








1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


481 


HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAWWLRGARKVKET 523 





RESULT 12 

US-10-012-121A-282 

Sequence 282, Application US/10012121A 
Publication No. US20030073810A1 
GENERAL INFORMATION: 
APPLICANT: Baker, Kevin P. 
APPLICANT: Botstein, David 
APPLICANT: Desnoyers f Luc 
APPLICANT: Eaton, Dan 1. 
APPLICANT: Ferrara, Napoleone 
APPLICANT: Fong, Sherman 
APPLICANT: Gao, Wei-Qiang 
APPLICANT: Goddard, Audrey 
APPLICANT: Godowski, Paul J. 
APPLICANT: Grimaldi, Christopher J. 
APPLICANT: Gurney, Austin L. 
APPLICANT: Hillan, Kenneth J. 
APPLICANT: Pan, James 



; APPLICANT: Paoni, Nicholas F. 

; TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
; TITLE OF INVENTION: Acids Encoding the Same 
; FILE REFERENCE: P2830P1C20 

; CURRENT APPLICATION NUMBER: US/10/012, 121A 
; CURRENT FILING DATE: 2001-12-07 

; Prior Application removed - See File Wrapper or Palm 
; NUMBER OF SEQ ID NOS : 477 

; SEQ ID NO 282 
LENGTH: 523 
; TYPE: PRT 

; ORGANISM: Homo sapiens 
US-10-012-121A-282 



Query Match 100.0%; Score 2768; DB 14; Length 523; 

Best Local Similarity 100.0%; Pred. No. 6.6e-272; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNWMLNHKR 


60 






1 1 1 1 1 1 1 1 II 1 1 1 I 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 




Db 


1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 


c c\ 
bu 


Qy 


. 61 


GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 


120 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 




Db 


61 


GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 


ion 
LA U 


Qy 


121 


CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 


180 




1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


121 


CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 


1 on 


Qy 


181 


LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


181 


LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 


240 


Qy 


241 


LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


241 


LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 


300 


Qy 


301 


GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDV1ILAANVKIVDWLPQSDLLAH 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVliLAANVKIVDWLPQSDLLAH 


360 


Qy 


361 


PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 


420 






1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 


420 


Qy 


421 


ALKMKQIMEDKRYKSAAVAASVI LRSHPLS PTQRLVGWI DHVLQTGGATHLKP YVFQQPW 


480 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


421 


ALKMKQIMEDKRYKSAAVAASVI LRSHPLS PTQRLVGWI DHVLQTGGATHLKP YVFQQPW 


480 


Qy 


481 


HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 








1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 




Db 


481 


HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 





RESULT 13 

US-10-006-116A-282 



Sequence 282, Application US/10006116A 
Publication No. US20030082626A1 
GENERAL INFORMATION: 

APPLICANT: Baker, Kevin P. 

APPLICANT: Botstein, David 

APPLICANT: Desnoyers, Luc 

APPLICANT: Eaton, Dan 1. 

APPLICANT: Ferrara, Napoleone 

APPLICANT: Fong, Sherman 

APPLICANT: Gao, Wei-Qiang 

APPLICANT: Goddard, Audrey 

APPLICANT: Godowski, Paul J. 

APPLICANT: Grimaldi, Christopher J. 

APPLICANT: Gurney, Austin L. 

APPLICANT: Hillan, Kenneth J. 

APPLICANT: Pan, James 

APPLICANT: Paoni, Nicholas F. 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2830P1C15 

CURRENT APPLICATION NUMBER: US/ 10/ 006, 116A 
CURRENT FILING DATE: 2001-12-16 
PRIOR APPLICATION NUMBER: 60/098716 
PRIOR FILING DATE: 1998-09-01 
PRIOR APPLICATION NUMBER: 60/098723 
PRIOR FILING DATE: 1998-09-01 
PRIOR APPLICATION NUMBER: 60/098749 
PRIOR FILING DATE: 1998-09-01 
PRIOR APPLICATION NUMBER: 60/098750 
PRIOR FILING DATE: 1998-09-01 
PRIOR APPLICATION NUMBER: 60/098803 
PRIOR FILING DATE: 1998-09-02 
PRIOR APPLICATION NUMBER: 60/098821 
PRIOR FILING DATE: 1998-09-02 
PRIOR APPLICATION NUMBER: 60/098843 
PRIOR FILING DATE: 1998-09-02 
PRIOR APPLICATION NUMBER: 60/099536 
PRIOR FILING DATE: 1998-09-09 
PRIOR APPLICATION NUMBER: 60/099596 
PRIOR FILING DATE: 1998-09-09 
PRIOR APPLICATION NUMBER: 60/099598 
PRIOR FILING DATE: 1998-09-09 
PRIOR APPLICATION NUMBER: 60/099602 
PRIOR FILING DATE: 1998-09-09 
PRIOR APPLICATION NUMBER: 60/099642 
PRIOR FILING DATE: 1998-09-09 
PRIOR APPLICATION NUMBER: 60/0997 41 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/099754 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/0997 63 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/099792 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/099808 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/099812 



PRIOR FILING DATE : 1998-09-10 
PRIOR APPLICATION NUMBER: 60/099815 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/099816 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/100385 
PRIOR FILING DATE: 1998-09-15 
PRIOR APPLICATION NUMBER: 60/100388 
PRIOR FILING DATE: 1998-09-15 
PRIOR APPLICATION NUMBER: 60/100390 
PRIOR FILING DATE: 1998-09-15 
PRIOR APPLICATION NUMBER: 60/100584 
PRIOR FILING DATE: 1998-09-16 
PRIOR APPLICATION NUMBER: 60/100627 
PRIOR FILING DATE: 1998-09-16 
PRIOR APPLICATION NUMBER: 60/100661 
PRIOR FILING DATE: 1998-09-16 
PRIOR APPLICATION NUMBER: 60/100662 
PRIOR FILING DATE: 1998-09-16 
PRIOR APPLICATION NUMBER: 60/100664 
PRIOR FILING DATE: 1998-09-16 
PRIOR APPLICATION NUMBER: 60/100683 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/100684 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/100710 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/100711 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/100848 
PRIOR FILING DATE: 1998-09-18 
PRIOR APPLICATION NUMBER: 60/100849 
PRIOR FILING DATE: 1998-09-18 
PRIOR APPLICATION NUMBER: 60/100919 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/100930 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/101014 
PRIOR FILING DATE: 1998-09-18 
PRIOR APPLICATION NUMBER: 60/101068 
PRIOR FILING DATE: 1998-09-18 
PRIOR APPLICATION NUMBER: 60/101071 
PRIOR FILING DATE: 1998-09-18 
PRIOR APPLICATION NUMBER: 60/101279 
PRIOR FILING DATE: 1998-09-22 
PRIOR APPLICATION NUMBER: 60/101471 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101472 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101474 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101475 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101476 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101477 
PRIOR FILING DATE: 1998-09-23 



PRIOR APPLICATION NUMBER: 60/101479 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101738 
PRIOR FILING DATE: 1998-09-24 
PRIOR APPLICATION NUMBER: 60/101741 
PRIOR FILING DATE: 1998-09-24 
PRIOR APPLICATION NUMBER: 60/101743 
PRIOR FILING DATE: 1998-09-24 
PRIOR APPLICATION NUMBER: 60/101915 
PRIOR FILING DATE: 1998-09-24 
PRIOR APPLICATION NUMBER: 60/101916 
PRIOR FILING DATE: 1998-09-24 
PRIOR APPLICATION NUMBER: 60/102207 
PRIOR FILING DATE: 1998-09-29 
PRIOR APPLICATION NUMBER: 60/102240 
PRIOR FILING DATE: 1998-09-29 
PRIOR APPLICATION NUMBER: 60/102307 
PRIOR FILING DATE: 1998-09-29 
PRIOR APPLICATION NUMBER: 60/102330 
PRIOR FILING DATE: 1998-09-29 
PRIOR APPLICATION NUMBER: 60/102331 
PRIOR FILING DATE: 1998-09-29 
PRIOR APPLICATION NUMBER: 60/1024 84 
PRIOR FILING DATE: 1998-09-30 
PRIOR APPLICATION NUMBER: 60/1024 87 
PRIOR FILING DATE: 1998-09-30 
PRIOR APPLICATION NUMBER: 60/102570 
PRIOR FILING DATE: 1998-09-30 
PRIOR APPLICATION NUMBER: 60/102571 
PRIOR FILING DATE: 1998-09-30 
PRIOR APPLICATION NUMBER: 60/102684 
PRIOR FILING DATE: 1998-10-01 
PRIOR APPLICATION NUMBER: 60/102687 
PRIOR FILING DATE: 1998-10-01 
PRIOR APPLICATION NUMBER: 60/102965 
PRIOR FILING DATE: 1998-10-02 
PRIOR APPLICATION NUMBER: 60/103258 
PRIOR FILING DATE: 1998-10-06 
PRIOR APPLICATION NUMBER: 60/103314 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103315 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103328 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103395 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103396 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/1034 01 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103449 
PRIOR FILING DATE: 1998-10-06 
PRIOR APPLICATION NUMBER: 60/103633 
PRIOR FILING DATE: 1998-10-08 
PRIOR APPLICATION NUMBER: 60/103678 
PRIOR FILING DATE: 1998-10-08 
PRIOR APPLICATION NUMBER: 60/103679 



PRIOR FILING DATE: 1998-10-08 
PRIOR APPLICATION NUMBER: 60/103711 
PRIOR FILING DATE: 1998-10-08 
PRIOR APPLICATION NUMBER: 60/104257 
PRIOR FILING DATE: 1998-10-14 
PRIOR APPLICATION NUMBER: 60/104987 
PRIOR FILING DATE: 1998-10-20 
PRIOR APPLICATION NUMBER: 60/105000 
PRIOR FILING DATE: 1998-10-20 
PRIOR APPLICATION NUMBER: 60/105002 
PRIOR FILING DATE: 1998-10-20 
PRIOR APPLICATION NUMBER: 60/105104 
PRIOR FILING DATE: 1998-10-21 
PRIOR APPLICATION NUMBER: 60/105169 
PRIOR FILING DATE: 1998-10-22 
PRIOR APPLICATION NUMBER: 60/105266 
PRIOR FILING DATE: 1998-10-22 
PRIOR APPLICATION NUMBER: 60/105693 
PRIOR FILING DATE: 1998-10-26 
PRIOR APPLICATION NUMBER: 60/105694 
PRIOR FILING DATE: 1998-10-26 
PRIOR APPLICATION NUMBER: 60/105807 
.PRIOR FILING DATE: 1998-10-27 
PRIOR APPLICATION NUMBER: 60/105881 
PRIOR FILING DATE: 1998-10-27 
PRIOR APPLICATION NUMBER: 60/105882 
PRIOR FILING DATE: 1998-10-27 
PRIOR APPLICATION NUMBER: 60/106023 
PRIOR FILING DATE: 1998-10-28 

Query Match 100.0%; Score 2768; DB 14; Length 523; 

Best Local Similarity 100.0%; Pred. No. 6.6e-272; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

Qy 61 GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 61 GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYIALQ 120 

Qy 121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 18 0 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 

Qy 181 LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 240 

I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 24 0 

Qy 241 LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFI7VKFGDSGFVLVTL 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 300 

Qy 301 GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVTIIJ^VKIVDWLPQSDLLM 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 301 GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDV^LAANVKIVDWLPQSDLLAH 360 



Qy 361 PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I 1.1 I I I I I I I I I I I I I I I I I J.I I I I I I I I I I I I I I 

Db 361 PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 420 

Qy 421 ALKMKQIMEDKRYKSAAVAASVT LRSHPLS PTQRLVGWI DHVLQTGGATHLKP YVFQQPW 4 80 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 ALKMKQIMEDKRYKSAAVAASVI LRSHPLS PTQRLVGWI DHVLQTGGATHLKP YVFQQPW 4 80 

Qy 481 HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 481 HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 



RESULT 14 

US-10-006-117A-282 

; Sequence 282, Application US/10006117A 
; Publication No. US20030082627A1 
; GENERAL INFORMATION: 



APPLICANT: 


Baker, Kevin P. 


APPLICANT: 


Botstein, David 


APPLICANT: 


Desnoyers, Luc 


APPLICANT: 


Eaton, Dan 1 . 


APPLICANT: 


Ferrara, Napoleone 


APPLICANT: 


Fong, Sherman 


APPLICANT: 


Gao, Wei-Qiang 


APPLICANT: 


Goddard, Audrey 


APPLICANT: 


Godowski, Paul J. 


APPLICANT: 


Grimaldi, Christopher J. 


APPLICANT: 


Gurney, Austin L. 


APPLICANT: 


Hillan, Kenneth J. 


APPLICANT: 


Pan, James 


APPLICANT: 


Paoni, Nicholas F. 



; TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
; TITLE OF INVENTION: Acids Encoding the Same 
; FILE REFERENCE: P2830P1C13 

; CURRENT APPLICATION NUMBER: US/10/ 006, 117A 
; CURRENT FILING DATE: 2002-03-19 

; Prior Application removed - See File Wrapper or Palm 
; PRIOR FILING DATE: 2001-07-09 
; NUMBER OF SEQ ID NOS : 477 
; SEQ ID NO 2 82 

LENGTH: 523 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-10-006-117A-282 



Query Match 100.0%; Score 2768; DB 14; Length 523; 

Best Local Similarity 100.0%; Pred. No. 6.6e-272; 

Matches 52 3; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MAGQRVLLLVGFLLPGVLLSEAAKI LTI STVGGSHYLLMDRVSQI LQDHGHNVTMLNHKR 60 

Qy 61 GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I 



Db 



61 GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 12 0 



Qy 121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 

1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 

Qy 181 LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 240 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r 1 1 

Db 181 LS YVPVFRS LLTDHMD FWGRVKNFLMFFS FCRRQQHMQST FDNT I KEH FT EGS RPVLSHL 240 

Qy 241 LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 300 

Qy 301 GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHLAANVKIVDWLPQSDLLAH 360 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHLAANVKIVDWLPQSDLLAH 360 

Qy 361 PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 420 

Qy .421. ALKMKQIMEDKRYKSAAVAASVI LRSHPLS PT.QRLVGWI DHVLQTGGATHLKP YVFQQPW 48 0 

I I I I I I! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 ALKMKQIMEDKRYKSAAVAASVI LRSHPLS PTQRLVGWI DHVLQTGGATHLKP YVFQQPW 480 

Qy 481 HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 481 HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 



RESULT 15 

US-10-017-527A-282 

Sequence 282, Application US/10017527A 
Publication No. US20030082628A1 
GENERAL INFORMATION: 
APPLICANT: Baker , Kevin P. 
APPLICANT: Botstein, David 
APPLICANT: Desnoyers, Luc 
APPLICANT: Eaton, Dan 1. 
APPLICANT: Ferrara, Napoleone 
APPLICANT: Fong, Sherman 
APPLICANT: Gao, Wei-Qiang 
APPLICANT: Goddard, Audrey 
APPLICANT: Godowski, Paul J. 
APPLICANT: Grimaldi, Christopher J. 
APPLICANT: Gurney, Austin L. 
APPLICANT: Hillan, Kenneth J. 
APPLICANT: Pan, James 
APPLICANT: Paoni, Nicholas F. 

TITLE OF INVENTION: Secreted and Transmembrane Polypeptides and Nucleic 
TITLE OF INVENTION: Acids Encoding the Same 
FILE REFERENCE: P2830P1C63 

CURRENT APPLICATION NUMBER: US/10/017 , 527A 
CURRENT FILING DATE: 2001-12-13 
PRIOR APPLICATION NUMBER: 60/098716 
PRIOR FILING DATE: 1998-09-01 



PRIOR APPLICATION NUMBER: 60/098723 
PRIOR FILING DATE: 1998-09-01 
PRIOR APPLICATION NUMBER: 60/098749 
PRIOR FILING DATE: 1998-09-01 
PRIOR APPLICATION NUMBER: 60/098750 
PRIOR FILING DATE: 1998-09-01 
PRIOR APPLICATION NUMBER: 60/098803 
PRIOR FILING DATE: 1998-09-02 
PRIOR APPLICATION NUMBER: 60/098821 
PRIOR FILING DATE: 1998-09-02 
PRIOR APPLICATION NUMBER: 60/098843 
PRIOR FILING DATE: 1998-09-02 
PRIOR APPLICATION NUMBER: 60/099536 
PRIOR FILING DATE: 1998-09-09 
PRIOR APPLICATION NUMBER: 60/099596 
PRIOR FILING DATE: 1998-09-09 
PRIOR APPLICATION NUMBER: 60/099598 
PRIOR FILING DATE: 1998-09-09 
PRIOR APPLICATION NUMBER: 60/099602 
PRIOR FILING DATE: 1998-09-09 
PRIOR APPLICATION NUMBER: 60/099642 
PRIOR FILING DATE: 1998-09-09 
PRIOR APPLICATION NUMBER: 60/099741 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/099754 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/099763 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/099792 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/099808 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/099812 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/099815 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/099816 
PRIOR FILING DATE: 1998-09-10 
PRIOR APPLICATION NUMBER: 60/100385 
PRIOR FILING DATE: 1998-09-15 
PRIOR APPLICATION NUMBER: 60/100388 
PRIOR FILING DATE: 1998-09-15 
PRIOR APPLICATION NUMBER: 60/100390 
PRIOR FILING DATE: 1998-09-15 
PRIOR APPLICATION NUMBER: 60/100584 
PRIOR FILING DATE: 1998-09-16 
PRIOR APPLICATION NUMBER: 60/100627 
PRIOR FILING DATE: 1998-09-16 
PRIOR APPLICATION NUMBER: 60/100661 
PRIOR FILING DATE: 1998-09-16 
PRIOR APPLICATION NUMBER: 60/100662 
PRIOR FILING DATE: 1998-09-16 
PRIOR APPLICATION NUMBER: 60/100664 
PRIOR FILING DATE: 1998-09-16 
PRIOR APPLICATION NUMBER: 60/100683 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/100684 



PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/100710 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/100711 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/100848 
PRIOR FILING DATE: 1998-09-18 
PRIOR APPLICATION NUMBER: 60/100849 
PRIOR FILING DATE: 1998-09-18 
PRIOR APPLICATION NUMBER: 60/100919 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/100930 
PRIOR FILING DATE: 1998-09-17 
PRIOR APPLICATION NUMBER: 60/101014 
PRIOR FILING DATE: 1998-09-18 
PRIOR APPLICATION NUMBER: 60/101068 
PRIOR FILING DATE: 1998-09-18 
PRIOR APPLICATION NUMBER: 60/101071 
PRIOR FILING DATE: 1998-09-18 
PRIOR APPLICATION NUMBER: 60/101279 
PRIOR FILING DATE: 1998-09-22 
PRIOR APPLICATION NUMBER: 60/101471 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101472 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101474 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101475 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101476 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101477 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101479 
PRIOR FILING DATE: 1998-09-23 
PRIOR APPLICATION NUMBER: 60/101738 
PRIOR FILING DATE: 1998-09-24 
PRIOR APPLICATION NUMBER: 60/101741 
PRIOR FILING DATE: 1998-09-24 
PRIOR APPLICATION NUMBER: 60/101743 
PRIOR FILING DATE: 1998-09-24 
PRIOR APPLICATION NUMBER: 60/101915 
PRIOR FILING DATE: 1998-09-24 
PRIOR APPLICATION NUMBER: 60/101916 
PRIOR FILING DATE: 1998-09-24 
PRIOR APPLICATION NUMBER: 60/102207 
PRIOR FILING DATE: 1998-09-29 
PRIOR APPLICATION NUMBER: 60/102240 
PRIOR FILING DATE: 1998-09-29 
PRIOR APPLICATION NUMBER: 60/102307 
PRIOR FILING DATE: 1998-09-29 
PRIOR APPLICATION NUMBER: 60/102330 
PRIOR FILING DATE: 1998-09-29 
PRIOR APPLICATION NUMBER: 60/102331 
PRIOR FILING DATE: 1998-09-29 
PRIOR APPLICATION NUMBER: 60/102484 
PRIOR FILING DATE: 1998-09-30 



PRIOR APPLICATION NUMBER: 60/102487 
PRIOR FILING DATE: 1998-09-30 
PRIOR APPLICATION NUMBER: 60/102570 
PRIOR FILING DATE: 1998-09-30 
PRIOR APPLICATION NUMBER: 60/102571 
PRIOR FILING DATE: 1998-09-30 
PRIOR APPLICATION NUMBER: 60/102684 
PRIOR FILING DATE: 1998-10-01 
PRIOR APPLICATION NUMBER: 60/102687 
PRIOR FILING DATE: 1998-10-01. 
PRIOR APPLICATION NUMBER: 60/102965 
PRIOR FILING DATE: 1998-10-02 
PRIOR APPLICATION NUMBER: 60/103258 
PRIOR FILING DATE: 1998-10-06 
PRIOR APPLICATION NUMBER: 60/103314 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103315 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103328 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103395 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103396 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103401 
PRIOR FILING DATE: 1998-10-07 
PRIOR APPLICATION NUMBER: 60/103449 
PRIOR FILING DATE: 1998-10-06 
PRIOR APPLICATION NUMBER: 60/103633 
PRIOR FILING DATE: 1998-10-08 
PRIOR APPLICATION NUMBER: 60/103678 
PRIOR FILING DATE: 1998-10-08 
PRIOR APPLICATION NUMBER: 60/103679 
PRIOR FILING DATE: 1998-10-08 
PRIOR APPLICATION NUMBER: 60/103711 
PRIOR FILING DATE: 1998-10-08 
PRIOR APPLICATION NUMBER:. 60/104257 
PRIOR FILING DATE: 1998-10-14 
PRIOR APPLICATION NUMBER: 60/104987 
PRIOR FILING DATE: 1998-10-20 
PRIOR APPLICATION NUMBER: 60/105000 
PRIOR FILING DATE: 1998-10-20 
PRIOR APPLICATION NUMBER: 60/105002 
PRIOR FILING DATE: 1998-10-20 
PRIOR APPLICATION NUMBER: 60/105104 
PRIOR FILING DATE: 1998-10-21 
PRIOR APPLICATION NUMBER: 60/105169 
PRIOR FILING DATE: 1998-10-22 
PRIOR APPLICATION NUMBER: 60/105266 
PRIOR FILING DATE: 1998-10-22 
PRIOR APPLICATION NUMBER: 60/105693 
PRIOR FILING DATE: 1998-10-26 
PRIOR APPLICATION NUMBER: 60/105694 
PRIOR FILING DATE: 1998-10-26 
PRIOR APPLICATION NUMBER: 60/105807 
PRIOR FILING DATE: 1998-10-27 
PRIOR APPLICATION NUMBER: 60/105881 



PRIOR FILING DATE: 1998-10-27 
PRIOR APPLICATION NUMBER: 60/1058 82 
PRIOR FILING DATE: 1998-10-27 
PRIOR APPLICATION NUMBER: 60/106023 
PRIOR FILING DATE: 1998-10-28 



Query Match 100.0%; Score 2768; DB 14; Length 523; 

Best Local Similarity 100.0%; Pred. No. 6.6e-272; 

Matches 523; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


1 


MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 


60 






1 1 1 1 II 1 M M 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I || | | | | | I || I I 




UD 


-1 

1 


MAGQRVLLLVGFLLPGVLLSEAAKI LTI STVGGSHYLLMDRVSQI LQDHGHNVTMLNHKR 


60 


Qy 


61 


GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 


120 






i i i i i i t i i i i i i t i ■ i i i i i i i i i i i i i i ■ i ■ ■ i i ■ ■ i ■ i ■ ■ ■ i i > i i 

M 1 M II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I | | | | | | | | | | | | | 




dd 


ol 


GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 


120 


Qy 


121 


CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 


180 






i i i i i i i i i i i i i i ■ i i i i i i i ■ i t i ■ ■ i i i ■ ■ ■ it ■ ■ i i , .I 

1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


121 


CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 


180 


Qy 


,181 


LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 


240 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 II II 1 1 1 1 1 1 II 1 1 1 1 1 




Db 


181 


LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 


240 


Qy 


241 


LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 


300 






i i i i i i i i i i i ■ i i i i i i i i i i i i i i i ■ i ■ ■ i i ■ i i ■ i iii i i ■ i i ■ 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


241 


LLKAELWFINSDFAFDFARPLLPNTVTVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 


300 


Qy 


301 


GSMWTCQNPEIFKEMNNAFTUiLPQGVIWKCQCSHWPKDV^LAANVKIVDWLPQSDLLAH 


360 






1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 II 1 1 




Db 


301 


GSMVNTCQNPEI FKEMNNAFAHLPQGVI WKCQCSHWPKDVIiIA7\NWI VDWLPQSDLLAH 


360 


Qy 


361 


PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 


420 






1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | | | 




Db 


361 


PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 


420 


Qy 


421 


ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 


480 






1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 




Db 


421 


T^KMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 


480 


Qy 


481 


HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAWWLRGARKVKET 523 








1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


481 


HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAWWLRGARKVKET 523 





Search completed: May 7, 2004, 17:34:58 
Job time : 50 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: 



Title: 

Perfect score: 
Sequence : 



May 7, 2004, 17:28:39 ; Search time 45 Seconds 

(without alignments) 
3667.023 Million cell updates/sec 

US-10-017-867A-282 
2768 

1 MAGQRVLLLVGFLLPGVLLS . GKLLGMAVWWLRGARKVKET 523 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 1017041 seqs, 315518202 residues 

Total number of hits satisfying chosen parameters: 1017041 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

' Maximum Match 100% 

Listing first 45 summaries 



Database : SPTREMBL_25 : * 

1: sp_archea:* 

2: sp__bacteria : * 

3: sp_fungi:* 

4 : sp_human : * 

5: sp_invertebrate: * 

6 : sp_mammal : * 

7: sp_mhc:* 

8: sp_organelle : * 

9: sp_phage:* 
10: sp_plant:* 
11: sp_rodent:* 
12: sp_virus:* 
13: sp_vertebrate : * 
14: sp_unclassif ied: * 
15: sp_rvirus:* 
16: sp_bacteriap : * 
17: 5p_archeap:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


2754 


99 . 


c 
0 


coo 

b23 


4 


Q8NBP2 


Q8nbp2 homo sapien 


2 


2193 


/ y . 




coo 


4 


Q9 6DM6 


Q96dm6 homo sapion 


3 


T O 1 C 

lolo 


a c 
DO . 


r 
O 


coo 


11 


QoROYb 


Q8r0y5 mus musculu 


4 


1804 


c c 
DO . 




coo 


11 


Qo JZZ0 


Q8jzz0 mus musculu 


5 


1802 


65 . 


1 


523 


11 


Q8VC11 


Q8vcll mus musculu 


6 


963 


34 . 


8 


221 


4 


Q8NAW4 


Q8naw4 homo sapien 


7 


914 . 5 


33 . 


0 


252 


4 


Q8IYS9 


Q8iys9 homo sapien 


8 


792 




6 


O A O 


11 


QoBRY/ 


Q8bry7 mus musculu 


9 


727 


26. 


3 


541 


13 


Q98TB5 


Q98tbo gallus gall 


10 


698 


25 . 


2 


541 


11 


Q91W57 


Q91w57 mus musculu 


11 


689.5 


24 . 


9 


529 


6 


Q9TSL6 


Q9tsl6 macaca fasc 


12 


685 . 5 


24 . 


8 


529 


6 


097951 


097951 macaca fasc 


13 


685 . 5 


24 . 


8 


534 


11 


Q8BWQ1 


Q8bwql mus musculu 


14 


684 . 5 


24 . 


7 


498 


6 


Q9BDZ8 


Q9bdz8 bos taurus 


15 


681 


24 . 


6 


530 


11 


Q9R110 


Q9rll0 cavia porce 


16 


680 . 5 


24 . 


6 


534 


11 


Q9D811 


Q9d811 mus musculu 


17 


680 


24 . 


6 


529 


6 


Q9GLD9 


Q9gld9 macaca mula 


18 


677 . 5 


24 . 


5 


534 


11 


Q8R129 


Q8rl29 mus musculu 


19 


674 . 5 


24 . 


4 


530 


11 


Q7TT85 


Q7tt85 rattus norv 


20 


673 . 5 


24 . 


3 


533 


6 


046548 


046548 felis silve 


21 


672 . 5 


24 . 


3 


449 


4 


Q9H6S4 


Q9h6s4 homo sapien 


22 


670.5 


24 . 


2 


528 


11 


Q80X89 


Q80x89 mus musculu 


23 


670 


24 . 


2 


527 


11 


Q9ESE4 


. . ..Q9ese4 mus musculu 


24 


668 


24 . 


1 


529 


11 


Q8R084 


Q8r084 mus musculu 


25 


667 . 5 


24 . 


1 


529 


11 


Q8VIF8 


Q8vif8 cavia porce 


26 


667 


24 . 


1 


533 


6 


046549 


046549 felis sxlve 


27 


666. 5 


24 . 


1 


528 


6 


Q8WN97 


Q8wn97 macaca fasc 


28 


665 . 5 


24 . 


0 


533 


6 


046423 


046423 felis silve 


29 


665 


24 . 


0 


533 


6 


Q9TSL7 


Q9tsl7 macaca fasc 


30 


664 


24 . 


0 


531 


11 


Q8VD45 


Q8vd45 rattus norv 


31 


662 


23 . 


9 


533 


6 


Q95KM4 


Q95km4 macaca mula 


32 


661 


23. 


9 


528 


11 


Q8VIF9 


Q8vif9 cavia porce 


33 


660 . 5 


23 . 


9 


529 


6 


Q9GLE0 


Q9gle0 macaca mula 


34 


658 


23. 


8 


533 


6 


Q9BDG7 


Q9bdg7 macaca mula 


35 


650.5 


23. 


5 


532 


11 


Q8K154 


Q8kl54 mus musculu 


36 


643 


23 . 


2 


529 


11 


Q8BJL9 


Q8bjl9 mus musculu 


37 


637 . 5 


23 . 


0 


530 


11 


Q91WH2 


Q91wh2 mus musculu 


38 


637 . 5 


23 . 


0 


532 


4 


Q8WUQ4 


Q8wuq4 homo sapien 


39 


635 . 5 


23 . 


0 


529 


6 


018736 


018736 bos taurus 


40 


635 . 5 


23 . 


0 


530 


11 


Q8K169 


Q8kl69 mus musculu 


41 


634 


22. 


9 


530 


11 


Q8VCN3 


Q8vcn3 mus musculu 


42 


627 


22. 


7 


530 


6 


Q9TSM0 


Q9tsm0 macaca fasc 


43 


624.5 


22. 


6 


532 


6 


Q9TSL9 


Q9tsl9 macaca fasc 


44 


621.5 


22. 


5 


498 


11 


Q8VCQ9 


Q8vcq9 mus musculu 


45 


621 


22. 


4 


530 


6 


Q9TSL8 


Q9tsl8 macaca fasc 



ALIGNMENTS 



RESULT 1 
Q8NBP2 

ID Q8NBP2 PRELIMINARY; PRT; 523 AA. 

AC Q8NBP2; 

DT 01-OCT-2002 (TrEMBLrel . 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22 , Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 



DE Hypothetical protein NT2RP2002934 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBIJTaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Ota T., Nishikawa T., Suzuki Y., Kawai-Hio Y. , Hayashi K., Ishii S., 

RA Saito K., Yamamoto J., Wakamatsu A., Nagai T., Nakamura Y., 

RA Nagahari K. , Sugano S. r Isogai T. ; 

RT "HRI human cDNA sequencing project."; 

RL Submitted (MAR-2002) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AK075383; BAC11583.1; -. 

DR GO; GO: 0016758; F: transferase activity, transferring hexosyl . . .; IEA. 

DR GO; GO: 0008152; P : metabolism; IEA. 

DR InterPro; IPR002213; UDP_gluco_trans . 

DR Pfam; PF00201; UDPGT; 1. 

DR PROSITE; PS00375; UDPGT; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 523 AA; 59618 MW; 8 992 8A26AB9C09F5 CRC64; 

Query Match 99.5%; Score 2754; DB 4; Length 523; 

Best Local .Similarity 99.6%; Pred. No. 8.1e-222; 

Matches 521; Conservative 0; Mismatches 2; Indels 0; Gaps 0; 

Qy 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

Qy 61 GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 

Qy 121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 

I! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIP 180 

Qy 181 LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 LSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 240 

Qy 241 LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 300 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II 
Db 241 LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFEDSGFVLVTL 300 

Qy 301 GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVliLAANVKIVDWLPQSDLLAH 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVTiLAiWVKIVDWLPQSDLLAH 360 

Qy 361 PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 420 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 361 PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 420 



Qy 

Db 



421 
421 



480 
480 



Qy 481 HEQYLFDVFVFLLGLTLGTLWLCGKLLGMAWWLRGARKVKET 523 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 HEQYLLDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 



RESULT 2 
Q96DM6 

ID Q96DM6 PRELIMINARY; PRT; 523 AA. 

AC Q96DM6; 

DT 01-DEC-2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Hypothetical protein FLJ32504. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Small intestine; 

RA Kawakami B., Sugiyama A., Takemoto M. , Sugiyama T., Irie R. , 

RA Otsuki T., Sato H., Wakamatsu A., Ishii S., Yamamoto J., Isono Y., 

RA Kawai-Hio Y.., Saito K., Nishikawa T . , Kimura K., Yamashita H. , 

RA Matsuo K. , Nakamura Y., Sekine M. , Kikuchi H., Kanda K. , Wagatsuma M. , 

RA Murakawa K., Kanehori K., Takahashi-Fuj ii A., Oshima A., Suzuki Y., 

RA Sugano S., Nagahari K., Masuho Y., Nagai K., Isogai T.; 

RT "NEDO human cDNA sequencing project."; 

RL Submitted (OCT-2001) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AK057066; BAB71358.1; -. 

DR GO; GO: 0016758; F: transferase activity, transferring hexosyl . . .; IEA. 

DR GO; GO: 0008152; P : metabolism; IEA. 

DR InterPro; IPR002213; UDP_gluco_trans . 

DR Pfam; PF00201; UDPGT; 1. 

DR PROSITE; PS00375; UDPGT; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 523 AA; 59148 MW; 2034D6E90863EA8E CRC64; 

Query Match 79.2%; Score 2193; DB 4; Length 523; 

Best Local Similarity 78.8%; Pred. No. 7.5e-175; 

Matches 413; Conservative 45; Mismatches 64; Indels 2; Gaps 2; 

Qy 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

I I I I I I I I I III I I I I I I I I I I I I I I I : I II I I I I : I I I I I I I I : I I I I II I I I : 
Db 1 MVGQRVLLLVAFLLSGVLLSEAAKILTISTLGGSHYLLLDRVSQILQEHGHNVTML-HQS 59 

Qy 61 GPFM-PDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLAL 119 

I I : I I I : I I I I I I I I I : I I I I I : I I II : : I I I I : I I : : : I 

Db 60 GKFLIPDIKEEEKSYQVIRWFSPEDHQKRIKKHFDSYIETALDGRKESEALVKLMEIFGT 119 

Qy 12 0 QCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPI 17 9 

III: I : I I I I I I I I I I I I I : I II I I : I I I I I I I I I I I I I I I I : I I I I : I I I I 

Db 120 QCSYLLSRKDIMDSLKNENCDLVFVEAFDFCSFLIAEKLVKPFVAILPTTFGSLDFGLPS 17 9 

Qy 180 PLSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSH 239 

I I I II I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 18 0 PLSYVPVFPSLLTDHMDFWGRVKNFLMFFSFSRSQWDMQSTFDNTIKEHFPEGSRPVLSH 239 



Qy 240 LLLKAELWFINSDFAFDFARPLLPNTVWGGLMEKPIKPVPQDLENFIAKFGDSGFVXVT 299 

I I I I I I I I I : I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I : I I I I I I I : I I I I I 
Db 240 LLLKAELWFVNSDFAFDFARPLLPNTVYIGGLMEKPIKPVPQDLDNFIANFGDAGFVLVA 299 

Qy 300 LGSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHLAANVKIVDWLPQSDLLA 359 

I I I : I I I : I : I : I : I I I I I i I I I I I I II I I I I : I I I I I I I II I I I I I I I I I II 

Db 300 FGSMLNTHQSQEVLKKMHNAFAHLPQGVIWTCQSSHWPRDVHLATNVKIVDWLPQSDLLA 359 

Qy 360 HPS I RLFVTHGGQN S IMEAI QHGVPMVGI PLFGDQP ENMVRVE7VKKFGVS IQLKKLKAET 419 

I I I I I I I I I I I I I I I : I I I : I I I I I I I : I : III I I I I I II : I I I I : I : : I : I 
Db 360 HPSIRLFWHGGQNSVMETIRHGVPMVGLPWGDQHGNMVRWAKNYGVSIRLNQVTADT 419 

Qy 420 LALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQP 47 9 

I I I I I :: I I I I II I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I 
Db 420 LTLTMKQVIEDKRYKSAWAASVILHSQPLSPAQRLVGWIDHILQTGGATHLKPYVFQQP 479 

Qy 480 WHEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 

I I I I I I I I I I I I I I I I I I I : I I I I I I I I : I I I I I I I I I : I 
Db 480 WHEQYLIDVFVFLLGLTLGTMWLCGKLLGWARWLRGARKVKKT 523 



RESULT 3 

Q8R0Y5 

ID Q8R0Y5 PRELIMINARY; PRT; 523 AA. 

AC Q8R0Y5; 

DT 01-JUN-2002 (TrEMBLrel. 21, Created) 

DT 01-JUN-2002 (TrEMBLrel. 21, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel . 25, Last annotation update) 

DE Hypothetical protein. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Liver; 

RA Strausberg R. ; 

RL Submitted (MAR-2002) to the EMBL/ GenBank/ DDB J databases. 

DR EMBL; BC025940; AAH25940.1; -. 

DR GO; GO: 0016758; F: transferase activity, transferring hexosyl . . .; IEA. 

DR GO; GO: 0008152; P : metabolism; IEA. 

DR InterPro; IPR002213; UDP_gluco_trans . 

DR Pfam; PF00201; UDPGT; 1. 

DR PROSITE; PS00375; UDPGT; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 523 AA; 59742 MW; F2EF34F4 1C3DEB95 CRC64; 

Query Match 65.6%; Score 1815; DB 11; Length 523; 

Best Local Similarity 65.4%; Pred. No. 3.3e-143; 

Matches 342; Conservative 73; Mismatches 106; Indels 2; Gaps 2; 

Qy 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

II I I I I I I I I I I I II I I I I II : I I I : I I : I I I I I I I I I . : I I : : 
Db 1 MAAHRSWLLVSFFLLEVLLLEAAKILTISTLSASHYILMNRVSQILQGGGHDVIKLLYEG 60 



Qy 



61 GPFMPDFKKEEKSYQVI SWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 
I : I I I : I I I II I I : I I I I I : I : : : : I III: I I : : I I 



Db 



61 GD-IPDFRKENSSYQVINWRLPEDQQKTFENRWHRLIDEYAYGRSKYHTLLKIHQYFADL 119 



Qy 121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLP-I 179 

I I I I : I I I I I : I : I I I I : I :::: I I I I I I I I I I I I : I I -Mil 
Db 120 CSHLLSRKDIMELLQKENFDLVLLDSMDLCSFLIVEKLGKRFVSFLPFQFSYMDFGLPNA 179 

Qy 180 PLSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSH 239 

I I I I II: I III I I I I I I I I I I I II I : : : : : I : I I : : I I I I I I : I I I I 
Db 180 PLSYAPVYGSGLTDQMDFWGRVKNILMFFHFTKKRRDIFSQYGNTVQEHFAEGSQPVLSD 239 

Qy 240 LLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVT 299 

I I I I I I I I I : I I I I I I I I I I I I I I II I I I I :: I I :: I : I I I I I : I I :: I I I I II I I I 
Db 240 LLLKAELWFVNSDFALDFARPLFPNTVYVGGLLDKPVQPIPQDLEDFISQFGDSGFVLVA 299 

Qy 300 LGSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHLAANVKIVDW 359 

I I : I : I : I I I I I I : I I I I I I I I I : I I : I I I I I I I I I I I I I : II I I I I I I I 
Db 300 LDSWSMIQSKEIIKEMNSAFAHLPQGVLWTCKSSHWPKDVSLAPNVKIMDWLPQIDLLA 359 

Qy 360 H P S I RL FVTHGGQN S IMEAI QHGVPMVGI PL FGDQP ENMVRVEAKKFGVS I QLKKLKAET 419 

I I I II I I I I I I I II : I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : MM: 

Db 360 HPSIRLFVTHGGMNSVMEAVHHGVPMVGIPFFGDQPENMVRVEAKNLGVSTQLQTLKAES 419 

Qy 420 IiALKMKQIMEDKRYKSAAVAASVI LRSHPLS PT.QRLVGWI DHVLQTGGATHLKP YVFQQP 47 9 

I II : : : I I : I I I : I I : I : I I I I II : I I I II I I I I I : I II I I I II I I I I I I I 

Db 420 FLLTMKEVIEDQRYKTAAMASKVIRNSHPLTPAQRLVGWIDHILQTGGAAHLKPYAFQQP 479 

Qy 480 WHEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKE 522 

II I I I : I II : I I II I I II I II I I : I : I Mill: 

Db 4 80 WHEQYMLDVFLFLLGLTLGTLWLSVKVLVAVTRYLSISRKVKQ 522 



RESULT 4 
Q8JZZ0 

ID Q8JZZ0 PRELIMINARY; PRT; 523 AA. 

AC Q8JZZ0; 

DT 01-OCT-2002 (TrEMBLrel. 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein MGC37820. 

GN AI313915. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Liver; 

RA Strausberg R. ; 

RL Submitted (JUL-2 002) to the EMBL/ GenBank/ DDB J databases. 

DR EMBL; BC034837; AAH34837.1; 

DR MGD; MGI: 2145969; AI313915. 

DR GO; GO: 0016758; F: transferase activity, transferring hexosyl . . .; IEA. 

DR GO; GO: 0008152; P :metabolism; IEA. 

DR InterPro; IPR002213; UDP_gluco_trans . 

DR Pfam; PF00201; UDPGT; 1. 

DR PROSITE; PS00375; UDPGT; 1. 

KW Hypothetical protein. 



SQ SEQUENCE 523 AA; 59662 MW; 4F7BD5ACBAFB5127 CRC64 ; 



Query Match 65.2%; Score 1804; DB 11; Length 523; 

Best Local Similarity 65.4%; Pred. No. 2.7e-142; 

Matches 342; Conservative 69; Mismatches 110; Indels 2; Gaps 



2; 



Qy 

Db 



1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

II I I I : I I I : I I I I I I I I I I I : I I I : : : I I I I : I : I I I I I I : ' 
1 MAAHRRWLLMS FLFLEVI LLEAAKI LTI STLSASHYI VI S RVSQ VLHEGGHNVTKLLYES 60 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



61 GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 

: I II : I I : I I I I I : I I I I : : : I II III I I : : I 

61 AN-IPDFRKEKPSYQVINWRPPEDQEKKFADLRHRLTEEITYGRSKHHTLLKIHQYFGDL 119 

121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLP-I 179 

II I : I I I I I I I I I I I I I : I : : : : I I I I II I I I I I : I I : : I I I I 
120 CSQLLSRKDIMDFLKNENFDLVLLDSMDLCSLLIVEKLGKRFVSFLPFQFSYMDFGLPSA 179 

180 PLSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSH 239 

I I I I II: I III I I I I I I I I I I I I I I : I : : I : I : I I : I I I I I I : I I I I 

180 PLSYAPVYGSGLTDQMDFWGRVKNFLMFLDFSMKQREILSQYDSTIQEHFVEGSQPVLSD 239 

240 -LLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPV 299 

I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I :: I I :: I : I I I I I I I I :: I I I I I I I I I 

240 LLLKAELWFVNSDFALDFARPLFPNTVYVGGLLDKPVQPIPQDLENFISQFGDSGFVLVA 299 

300 LGSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDV^ 359 

I I I : I : I : II I I I I : I I I I I I I I I : I I : I I I I I II I I : I I I I : I I I I I : I I I I 

300 LGSIVSMIQSKEIIKEMNSAFAHLPQGVLWTCKTSHWPKDVSLASNVKIMDWLPQTDLLA 359 

360 HPSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAET 419 

I I I I I I I I I I I I I I : I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I : MM: 

360 HPSIRLFVTHGGMNSWIEAvllHGVPMVGIPFFFDQPENMVRVEAKNLGVSIQLQTLKAES 419 



Qy 



Db 



420 LALKMKQIMEDKRYKSAAVAASVI LRSHPLS PTQRLVGWI DHVLQTGGATHLKP YVFQQP 479 

I | | | : | : | I I I I I I I I : I : : I I I I I : I I I I : I I I I I : I I I I I I I I I I I I I I I 
420 FALTMKKI I EDKRYKSAAMASKI I RHSHPLTPAQRLLGWI DHI LQTGGAAHLKPYAFQQP 47 9 



Qy 



Db 



480 WHEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKE 522 

I I I I I : I II : I I I I I I I I I I I I : I : I I I I I I 

4 80 WHEQYMLDVFLFLLGLMLGTLWLSVKVLVAVTRYLSIATKVKE 522 



RESULT 5 
Q8VC11 
ID 



Q8VC11 PRELIMINARY; PRT; 523 AA. 

AC Q8VC11; 

DT 01-MAR-2002 (TrEMBLrel . 20, Created) 

DT 01-MAR-2002 (TrEMBLrel. 20, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein (Hypothetical UDP-glucoronosyl and UDP-glucosyl 

DE transferase containing protein). 

GN AI313915. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI TaxID=10090; 



RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Liver; 

RA Strausberg R. ; 

RL Submitted (JAN-2002) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Liver ; 

RX MEDLINE=22354683; PubMed=12466851; 

RA The FAN TOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

DR EMBL; BC022134; AAH22134.1; 

DR EMBL; AK050128; BAC34080.1; -. 

DR MGD; MGI:2145969; AI313915. 

DR GO; GO: 0016758; F: transferase activity, transferring hexosyl . . .; IEA. 

DR GO; GO: 0008152; P :metabolism; IEA. 

DR InterPro; IPR002213; UDP_gluco_trans . 

DR Pfam; PF002 01; UDPGT; 1. 

DR PROSITE; PS00375; UDPGT; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 523 AA; 59672 MW; BC7BD6ADF197ADD9 CRC64; 



Query Match 65.1%; Score 1802; DB 11; Length 523; 

Best Local Similarity 65.4%; Pred. No. 4e-142; 

Matches 342; Conservative 68; Mismatches 111; Indels 2; Gaps 2; 

Qy 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

II I I I : II I : I I I I I I I I I I I : I I I : : : I I I I : I : I I I I I I : : 

Db 1 MAAHRRWLLMSFLFLEVILLEAAKILTISTLSASHYIVISRVSQVLHEGGHNVTKLLYES 60 

Qy 61 GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQ 120 

: I I I : I I : I I I I I : I I I I : : : I II III I I : : I 

Db 61 AN-IPDFRKEKPSYQVINWRPPEDQEKKFADLRHRLTEEITYGRSKHHTLLKIHQYFGDL 119 

Qy 121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLP-I 17 9 

II I : I I I I I I I I I I I I I : I : : : : I I I I I I I I I I I : I I : : I I I I 
Db 120 CSQLLSRKDIMDFLKNENFDLVLLDSMDLCSLLIVEKLGKRFVSFLPFQFSYMDFGLPSA 179 

Qy 180 PL S YVP VFRS L LT DHMD FWGRVKN FLMFFS FCRRQQHMQS T FDNT I KEH FT EGS RPVL S H 239 

I I I I I I : I I I I I I I I I I I I I I I I I I : I : : I : I : I I : I I I I I I : I I I I 
Db 180 PLS YAP VYGSGLTDQMD FWGRVKN FLMFLDFSMKQREILSQYDSTIQEHFVEGSQPVLSD 239 

Qy 24 0 LLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVT 299 

I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I :: I I :: I : I I I II I I I :: I I I I I I I I I 

Db 24 0 LLLKAELWFVNSDFT^LDFARPLFPNTVYVGGLLDKPVQPIPQDLENFISQFGDSGFVLVA 299 

Qy 300 LGSMWTCQNPEIFKEMNNAF7\HLPQGVIWKCQCSHWPKDVHLAANVKIVDWLPQSDLLA 359 

I I I : I : I : I I I I I I : I I I M I I I I : I I : I I I I I I I I I I I I I : I I I I I : I I I I 

Db 300 LGSIVSMIQSKEIIKEMNSAFAHLPQGVLWTCKTSHWPKDVSLAPNVKIMDWLPQTDLLA 359 

Qy 360 HPSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAET 419 

I I I I I I I I I I I I I I : I I | : | I I I I I I I I I I I I I I I I I I I I I lllllhllll: 
Db 360 HPSIRLFVTHGGMNSVMEAVHHGVPMVGIPFFFDQPENMVRVEAKNLGVSIQLQTLKAES 419 



Qy 420 IALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQP 479 

II I I : I : I I I I I I I I I : I : : I I I I I : I I I i : I I I I I : II I I I I I I I I I I I I I 
Db 420 FALTMKKI I EDKRYKSAAMASKI I RHSHPLTPAQRLLGWI DHI LQTGGAAHLKPYAFQQP 479 

Qy 480 WHEQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKE 522 

I I I I I : I II : I I I I I I I I I I I I : I : I I I I I I 

Db 480 WHEQYMLDVFLFLLGLMLGTLWLSVKVLVAVTRYLSIATKVKE 522 

RESULT 6 
Q8NAW4 



ID Q8NAW4 PRELIMINARY; PRT; 221 AA. 

AC Q8NAW4; 

DT 01-OCT-2 002 (TrEMBLrel. 22, Created) 

DT 01-OCT-2002 (TrEMBLrel. 22, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE Hypothetical protein FLJ34658. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE-Kidney; 

RA Tashiro H., Yamazaki M. , Watanabe K. , Kumagai A., Itakura S., 

RA Fukuzumi Y. , Fujimori Y. , Komiyama M., Sugiyama T., Irie R., 

RA Otsuki T., Sato H. , Wakamatsu A., Ishii S., Yamamoto J., Isono Y. , 

RA Kawai-Hio Y., Saito K. , Nishikawa T., Kimura K. , Yamashita H., 

RA Matsuo K., Nakamura Y. , Sekine M. , Kikuchi H., Kanda K. , Wagatsuma M. , 

RA Murakawa K., Kanehori K., Takahashi-Fuj ii A., Oshima A., Sugiyama A. , 

RA Kawakami B . , Suzuki Y., Sugano S., Nagahari K., Masuho Y-, Nagai K., 

RA Isogai T. ; 

RT "NEDO human cDNA sequencing project."; 

RL Submitted (JUL-2002) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AK091977; BAC03783.1; 

DR GO; GO: 0016758; F: transferase activity, transferring hexosyl . . .; IEA. 

DR GO; GO: 0008152; P :metabolism; IEA. 

DR InterPro; IPR002213; UDP_gluco_trans . 

DR Pfam; PF00201; UDPGT; 1. 

DR PROSITE; PS00375; UDPGT; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 221 AA; 24764 MW; ACB59BEF47ACD4 58 CRC64; 



Query Match 34.8%; Score 963; DB 4; Length 221; 

Best Local Similarity 79.6%; Pred. No. 2.1e-72; 

Matches 17 6; Conservative 21; Mismatches 24; Indels 0; Gaps 0; 



Qy 303 MWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVIII^^VKIVDWLPQSDLLAHPS 362 

1:11 I : I : I : I : I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MLNTHQSQEVLKKMHNAFAHLPQGVIWTCQSSHWPRDVHLATNVKIVDWLPQSDLLAHPS 60 

Qy 363 IRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETLAL 422 

I I I I I I I I I I I I : I I I I : I I I I I I I : I : I I I I I I I I I I : I I I I : I : : I : I I I 

Db 61 I RL FVTHGGQN S VMEAI RH GVPMVGL P VNGDQHGNMVRWAKN YGVS I RLNQ VT ADT LT L 12 0 



Qy 



423 KMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPWHE 4 82 
II I : : I I I I I I I I I II I II I I MM I M I M II I : M M I I II I II I I I I II I I 



Db 121 TMKQVIEDKRYKSAVYAASVILHSQPLSPAQRLVGWIDHILQTGGATHLKPYAFQQPWHE 180 

Qy 4 83 -QYLFDVEVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKET 523 

III I I I I I I I I I I I I I : I I I I I I II : I I I I I I I I I : I 

Db 181 QYLIDVFVFLLGLTLGTMWLCGKLLGWARWLRGARKVKKT 221 



RESULT 7 
Q8IYS9 

ID Q8IYS9 PRELIMINARY; PRT ; 252 AA. 

AC Q8IYS9; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical protein. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Testis; 

RA Strausberg R. ; 

RL Submitted (JUL-2002) to the EMBL/ GenBank/ DDB J databases. 

DR EMBL; BC035012; AAH35012.1; 

DR GO; GO:0016758; F: transferase activity, transferring hexosyl . . .; IEA. 

DR GO; GO: 0008152; P rmetabolism; IEA. 

DR InterPro; IPR002213; UDP_gluco_trans . 

DR Pfam; PF00201; UDPGT; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 252 AA; 29155 MW; 956CC9F9718B8982 CRC64; 

Query Match 33.0%; Score 914.5; DB 4; Length 252; 

Best Local Similarity 75.7%; Pred. No. 2.9e-68; 

Matches 171; Conservative 21; Mismatches 33; Indels 1; Gaps 1; 

58 HKRGPFM-PDFKKEEKSYQVI SWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEY 116 

I : I I : I I I : I I I I I I I I I : I I I I I : I I I I : : I i I I : I I : :: I 

3 HQSGKFLIPDIKEEEKSYQVIRWFSPEDHQKRIKKHFDSYIETALDGRKESEALVKLMEI 62 

117 LALQCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFG 17 6 

III: I : II I I I I I I I I I I : I : I II I I : I I I I I I I I I I I I I I I I : I I I I : I I 

63 FGTQCSYLLSRKDIMDSLKNENYDLVFVEAFDFCSFLIAEKLVKPFVAILPTTFGSLDFG 122 

177 LPIPLSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPV 236 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M 

123 LPSPLSYVPVFPSLLTDHMDFWGRVKNFLMFFSFSRSQWDMQSTFDNTIKEHFPEGSRPV 182 

237 LSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQD 282 

I I I I I I I I I II I : II I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I : 

183 LSHLLLKAELWFVNSDFAFDFARPLLPNTVYIGGLMEKPIKPVPQN 228 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 



RESULT 8 
Q8BRY7 

ID Q8BRY7 PRELIMINARY; PRT; 302 AA. 

AC Q8BRY7; 



DT 01-MAR-2003 (TrEMBLrel . 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Hypothetical UDP-glucoronosyl and UDP-glucosyl transferase containing 

DE protein. 

GN AI313915. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI__TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Aorta , and Vein; 

RX MEDLINE=22354 68 3; PubMed=12466851 ; 

RA The FAN TOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,77 0 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

DR EMBL; AK041045; BAC30796.1; -. 

DR MGD; MGI:2145969; AI313915. 

DR GO; GO: 0016758; F: transferase activity, transferring hexosyl . . .; IEA. 

DR GO; GO: 0.00.8152; P :metabolism; IEA. 

DR InterPro; IPR002213; UDP_gluco_trans . 

DR Pfam; PF00201; UDPGT; 1. 

KW Hypothetical protein. 

SQ SEQUENCE 302 AA; 35256 MW; C78A84C1D58987DC CRC64; 



Query Match 28.6%; Score 792; DB 11; Length 302; 

Best Local Similarity 57.9%; Pred. No. 6.7e-58; 

Matches 157; Conservative 37; Mismatches 75; Indels 2; Gaps 2; 

Qy 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

II I 11:11 I : I I I I I I I I I I I : I I I : : : I I I I : I : I I I I I I : : 
Db 1 MAAHRRWLLMSFLFLEVILLEAAKILTISTLSASHYIVISRVSQVLHEGGHNVTKLLYES 60 

Qy 61 GPFMPDFKKEEKS YQVI SWLAPEDHQREFKKS FDFFLEETLGGRGKFENLLNVLEYLALQ 120 

: I I I : I I : I I I I I : I I I I : : : I II III I I : : I 

Db 61 AN-IPDFRKEKPSYQVINWRPPEDQEKKFADLRHRLTEEITYGRSKHHTLLKIHQYFGDL 119 

Qy 121 CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLP-I 179 

II I : I I I I I I I I I I I I I : I : : : : I I I I I I I I I I I : I I -MM 

Db 120 CSQLLSRKDIMDFLKNENFDLVLLDSMDLCSLLIVEKLGKRFVSFLPFQFSYMDFGLPSA 179 

Qy 180 PLSYVPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSH 239 

I I I I I I : I I I I I I I I I I I I I I I I I I : I : : I : I : I I : I I I I I I : I I I I 

Db 180 PLSYAPVYGSGLTDQMDFWGRVKNFLMFLDFSMKQREILSQYDSTIQEHFVEGSQPVLSD 239 

Qy 24 0 LLLKAELWFINSDFAFDFARPLLPNTVYVGG 27 0 

I I I I I I I I I : I I I I I MINI I I I I I I I I 
Db 240 LLLKAELWFVNSDFALDFARPLFPNTVYVGG 270 



RESULT 9 
Q98TB5 

ID Q98TB5 PRELIMINARY; PRT; 541 AA. 

AC Q98TB5; 



DT 01-JUN-2001 (TrEMBLrel. 17, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE UDP-galactose ceramide galactosyltransf erase (EC 2.4.1.47). 

GN CGT . 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae; Gallif ormes ; Phasianidae; Phasianinae; 

OC Gallus. 

OX NCBI_TaxID=9031 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RA Kapitonov D. ; 

RT "Molecular cloning and expression of ceramide galactosyltransf erases . 

RT Comparison with other glycosyltransf erases . " ; 

RL Thesis (1997), Medical College of Virginia, Richmond, VA, USA. 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Kapitonov D. ; 

RL Submitted (FEB-1999) to the EMBL/GenBank/ DDBJ databases. 

DR EMBL; AF129809; AAK16234.1; 

DR GO; GO: 0047263; F: N-acylsphingosine galactosyltransf erase act. . .; IEA. 

DR GO; GO: 0016758; F: transferase. ..activity, transferring hexosyl . . .; IEA. 

DR GO; GO: 0008152; P :metabolism; IEA. 

DR InterPro; IPR002213; UDP_gluco_trans . 

DR Pfam; PF00201; UDPGT; 1. 

DR PROSITE; PS00375; UDPGT; 1. 

KW Glycosyltransf erase; Transferase . 

SQ SEQUENCE 541 AA; 61598 MW; DFF1CA4C69E781CC CRC64; 

Query Match 26.3%; Score 727; DB 13; Length 541; 

Best Local Similarity 33.3%; Pred. No. 4.1e-52; 

Matches 166; Conservative 99; Mismatches 199; Indels 34; Gaps 10; 

Qy 13 LLPGVLLSELAAKILTISTV-GGSHYLLMDRVSQILQDHGHNVTMLNHKRGPFMPDFKKEE 71 

I | : : | M I : : : I I : : : I I I I I : I 

Db 11 LWSAVGIARAAKIVWPPIMFESHLYIFKTLASALHDQGHQTVFLLSEGREIPPSNHYRL 70 

Qy 72 KSYQVISWLAPEDHQREFKKS-FDFFLEETL GGRGKFENLLNVLEYLALQCSHFLN 126 

III I I I I I : : II | :: | :: : | : 

Db 71 KRYPGI FNSSTSDDFLQSKMRSIFSGRLTALELFDILDHYSKNCDMIVG 119 

Qy 127 RKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPIPLSYVPV 18 6 

::: I : I I I II : :: I : : I I : I I II : : : I I I I I I I I I 

Db 120 NQNLMHALKQEKFDLLLVDPNEMCGFVIAHLLGVKYAVFSTGLWYPAEVGAPAPLSYVPE 17 9 

Qy 187 FRS LLT DHMD FWGRVKN FLMF FSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHL 240 

| | | | | | | : : | : | | : : | | : : : : : : | | : I 

Db 180 FNSLLTDRMNLFERMKNTFVYVISRFGVSFL VLPKYERIMQKHKVLPERSMYD-L 233 

Qy 241 LLK7VELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 300 

: : I I : : I I : I I I I I I I I I I I : : I I I : I : I I : : : : : I I I I I : 

Db 234 VI1GSSLWMLCTDIALEFPRPTLPNVVYVGGILTKPASPLPEDLQTWVNGANENGFVLVSF 293 



Qy 

Db 



301 GSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVIILAANVKIVDWLPQSDLL7U1 360 

I : I : : : : : I I I I I I I I : : I ■ : I I I : : : I I I I : I I I I 

294 GAGVKYL-SEDVANKLARALARLPQRVIWRFSGN KPRNLGNNTKLIEWLPQNDLLGH 34 9 



Qy 361 PSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETL 420 

I : I : I :: I I I I I I I : I I I I : I I I I I I I I : I I I : I I I : ■ : I : I 
Db 350 PNIKAFLSHGGLNSIFETMYHGVPWGIPLFGDHYDTMTRVQAKGMGILLNWKTVTESEL 409 

Qy 421 ALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPW 480 

: : : : : I I : I I I : I I I I I I : : : I : I I I I : I : 
Db 410 YEALEKVINDPSYRQRAQRLSEIHKDQPGHPVNRTVYWINYILRHNGAQHLRAAVYSISL 469 

Qy 481 HEQYLFDV-FVFLLGLTL 4 97 

: : : I I : I I I : I I 
Db 470 YQYFLLDIAFWLVGAAL 487 



RESULT 10 
Q91W57 

ID Q91W57 PRELIMINARY; PRT; 541 AA. 

AC Q91W57; 

DT 01-DEO2001 (TrEMBLrel. 19, Created) 

DT 01-DEC-2001 (TrEMBLrel. 19, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE UDP-glucuronosyltransf erase 8. 

GN UGT8. 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Kidney; 

RA Strausberg R. ; 

RL Submitted (NOV-2 001) to the EMBL/ GenBank/ DDB J databases. 

DR EMBL; BC016885; AAH16885.1; -. 

DR MGD; MGI: 109522; Ugt8 . 

DR GO; GO: 0016758; F: transferase activity, transferring hexosyl . . .; IEA. 

DR GO; GO: 0016740; F: transferase activity; IEA. 

DR GO; GO: 0008152; P : metabolism; IEA. 

DR InterPro; IPR002213; UDP_gluco_trans . 

DR Pfam; PF00201; UDPGT; 1. 

DR PROSITE; PS00375; UDPGT; 1. 

KW Transferase. 

SQ SEQUENCE 541 AA; 61249 MW; 0E3436597F8810D0 CRC64; 

Query Match 25.2%; Score 698; DB 11; Length 541; 

Best Local Similarity 32.4%; Pred. No. l.le-49; 

Matches 167; Conservative 95; Mismatches 184; Indels 70; Gaps 12; 

Qy 13 LLPGVLLSEAAKILTISTV-GGSHYLLMDRVSQILQDHGHN-VTMLNHKRGPFMPDFKKE 70 

I | : : | | | | : : : I I : : : I : I I : I : I : I 
Db 11 LWSAVGIARAAKI I IVPPIMFESHLYI FKTLASALHERGHHTVLLLSEGRD 61 

Qy 71 EKSYQVI SWLAPEDHQ — REFKKSF DFFLEETL GGRGKFENLLNVLEYLAL 119 

: I I : I : : I III:: II | : : : | : : 

Db 62 IAPSNHYSLQRYPGIFNSTTSDAFLQSKMRNIFSGRLTAVELVDILDHYTK 112 

Qy 120 QCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPI 179 

I : : : : . I I I I I : : : I : I I I : I I I I : : : III 



Db 



113 NCDMMVGNQALIQGLKKEKFDLLLVDPNDMCGFVIAHLLGVKYAVFSTGLWYPAEVGAPA 172 



Qy 180 PLSYVPVFRSLLTDHMDFWGRVKN FLMFFS FCRRQQH MQSTFD 222 

I I : I I I I I I I I I I : I I : I I I I : : I I : I : I 

Db 173 PLAYVPEFNSLLTDRMNFLERMKNTGVYLISRIGVSFLVLPKYERIMQKYNLLPAKSMYD 232 

Qy 223 NTIKEHFTEGSRPVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQD 282 

I : : I I : : I I : I I I I I I I I I I I : : I I I : I : I 
Db 233 LVHGSSLWMLCTDVALEFPRPTLPNWYVGGILTKPASPLPED 275 

Qy 283 LENFIAKFGDSGFVLVTLGSMVNTCQNPEIFKEIWNAFAHLPQGVIWKCQCSHWPKDVHL 342 

I : : : : : I I I I I : I : I : : I : : I III III: : I : I 

Db 276 LQRWVSGAQEHGFVLVSFGAGVKYL-SEDI7\NKLAGALGRLPQKVIWRFSGT KPKNL 331 

Qy 343 AANVKIVDWLPQSDLLAHPSIRLFWHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVE 402 

I I : : : II I I : I I I I : I I I : : I I I III I : I I I I : I I I I I I I I : I I I : 
Db 332 GNNTKLI EWLPQNDLLGHSNI RAFLSHGGLNS I FETMYHGVPWGI PLFGDHYDTMTRVQ 391 

Qy 4 03 AKKFGVSIQLKKLKAETLALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHV 4 62 

II I : : : : I : : : : : I : | I I : | | | II I : : 

Db 392 AKGMGILLEWNTWEGELYDALVKVINNPSYRQRAQKLSEIHKDQPGHPVNRTTYWIDYI 451 

Qy 463 LQTGGATHLKP YVFQQPWHEQYLFDV- FVFLLGLTL 497 

I : MM: || : : M I : M I I I I 
Db 452 LRHDGARHLRSAVHQISFCQYFLLDIAFVLLLGAVL 487 



RESULT 11 
Q9TSL6 

ID Q9TSL6 PRELIMINARY; PRT; 529 AA. 

AC Q9TSL6; 

DT 01-MAY-2000 (TrEMBLrel. 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-JUN-2003 (TrEMBLrel. 24, Last annotation update) 

DE UDP-glucuronosyltransferase 2B23 (EC 2.4.1.17). 

OS Macaca fascicularis (Crab eating macaque) (Cynomolgus monkey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Macaca. 

OX NCBI_TaxID=954 1 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=20043918; PubMed=10579317 ; 

RA Barbier O., Levesque E., Belanger A., Hum D.W.; 

RT "UGT2B23, a novel uridine diphosphate-glucuronosyltransf erase enzyme 

RT expressed in steroid target tissues that conjugates androgen and 

RT estrogen metabolites . " ; 

RL Endocrinology 140:5538-5548(1999). 

DR EMBL; AF112113; AAF14353.1; 

DR GO; GO: 0015020; F: glucuronosyltransf erase activity; IEA. 

DR GO; GO: 0008152; P :metabolism; IEA. 

DR InterPro; IPR002213; UDP_gluco_trans . 

DR Pfam; PF00201; UDPGT; 1. 

DR PROSITE; PS00375; UDPGT; 1. 

KW Glycosyltransferase; Transferase. 

SQ SEQUENCE 529 AA; 60957 MW; 646315E6D970541A CRC64; 



Query Match 24,9%; Score 689.5; DB 6; Length 529; 

Best Local Similarity 32.1%; Pred. No. 5.5e-49; 

Matches 168; Conservative 85; Mismatches 205; Indels 65; 



Gaps 14; 



Qy 34 S H YL LMDRVS Q I LQDHGHN VTML NH KRGPFMPDFKKEE KSYQ 75 

I I : : I : : I I I I I I I : I I II : : 

Db 34 S HWMNMKT I L E E L VQ RGH E VT ALAS S AS ILFDPNNS SAL K I E VF P T S L P K P E FEN I VT Q E 93 

Qy 76 VI SWL-APEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQCSHFLNRKDIMDSL 134 

: I : I : I : I : : I I : : I : : I : I I 

Db 94 IKRWIELPKD TFWLYFSQMQEIMWKFGDI FRNF CKDWSNKKLMKKL 140 

Qy 135 KNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEF GLPIPLSY 183 

: I I : I : I I : I I II III I III 

Db 141 QESRFDWFADPIFPCSELLAELFNIPLVY SLRFTPGYVFEKHCGGFLFPPSY 193 

Qy 184 VPVFRSLLTDHMDFWGRVKN — FLMFFSFCRRQQHMQSTFDNTIKEHFTE — GSRPVLSH 239 

III I I : I I I I I I I : : : : I II : I : : : I I I II 

Db 194 VP WMS E LS DQMT FMERVKNMI YML YFD FC FQ I YDMKKW DQFYTEVLGRHTTLSE 248 

Qy 240 LLLKAELWFINSDFAFDFARPLLPNTVTVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVT 299 

: : I I : : I I : : I I I I II I : : I I I : I I I I : I : : : I I : I : : I I : I 
Db 249 IMGKADIWLIRNSWNFQFPHPLLPNVDFIGGLLCKPAKPLPKEMEEFVQSSGENGWVFT 308 

Qy 300 LGSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDV1ILAANVKIVDWLPQSDLLA 359 

lllhl I : : I I : I I I : I : : I I I : : I : I I : I I I 
Db 309 LGSMI -TNMKEERANVI ASALAQI PQKVLWRFDGN KPDTLGVNTRLYKWI PQNDLLG 364 

Qy 360 HPSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAET 419 

II : I : I I I I I I I I I I I I I I I I I I I I I I I : I : : : : I : : : 

Db 365 HPKTKAFITHGGTUSIGIYEAIYHGVPMVGIPLFADQPDNITVHMKTRGAAVQLDFDTMSSTD 424 

Qy 420 LALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQP 479 

I : I : : I II : III hi I I I I : I : : lllhl 
Db 425 LWALKTVINDPLYKENWKLSRIQRDQPVKPLDRAVFWIEFVMRHKGAKHLRPAAHDLT 484 

Qy 4 80 WHEQYLFDVFVFLLGLTLGTLWLCGKLLGMAWWLRGARKVKE 522 

I : : I I I III : : : I I : I I I I : 

Db 485 WFQYHSFDVIGFLLACVATVIFIIMKCCLFCFW — KFARKGKK 525 

RESULT 12 
097951 



ID 097951 PRELIMINARY; PRT; 529 AA. 

AC 097951; 

DT 01-MAY-1999 (TrEMBLrel . 10, Created) 

DT 01-MAY-1999 (TrEMBLrel. 10, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE UDP-glucuronosyltransf erase . 

GN UGT2B18. 

OS Macaca fascicularis (Crab eating macaque) (Cynomolgus monkey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Macaca. 

OX NCBI_TaxID=9541; 

RN [1] 

RP SEQUENCE FROM N.A. 



RA Beaulieu M. , Levesque E., Barbier O., Turgeon D., Belanger G., 

RA Hum D.W., Belanger A.; 

RL Submitted (JUL-1997) to the EMBL/ GenBank/ DDB J databases. 

DR EMBL; AF016310; AAC98726.1; 

DR GO; GO: 0016758; F: transferase activity, transferring hexosyl . . . ; IEA. 

DR GO; GO: 0016740; F: transferase activity; IEA. 

DR GO; GO: 0008152; P :metabolism; IEA. 

DR InterPro; IPR002213; UDP__gluco_trans . 

DR Pfam; PF00201; UDPGT; 1. 

DR PROSITE; PS00375; UDPGT; 1. 

KW Transferase. 

SQ SEQUENCE 529 AA; 60801 MW; 3ECEB4 97B8C3601F CRC64; 



Query Match 24.8%; Score 685.5; DB 6; Length 529; 

Best Local Similarity 33.2%; Pred. No. 1.2e-48; 

Matches 173; Conservative 75; Mismatches 212; Indels 61; Gaps 14; 

Qy 34 SH YLLMDRVSQI LQDHGHNVTML NH KRGPFMPDFKKEEKS YQ 75 

I I : : I : : I M I I : I I : I I II I 

Db 34 S HWMNMKT I L E E L VQ RGH E VT VLAS S AS I L FD PNN S S AL K I E VF P T S LT KT E FEN 1 1 RQQ 93 

Qy 76 VISWL-APEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQCSHFLNRKDIMDSL 134 

: I 1:1 : I : : I I : : I :: I : I I 

Db 94 IKRWSELPKD TFWLYFSQMQEIMWKFGDITRNF CKDWSNKKLMKKL 140 

Qy 135 KNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEF GLPIPLSY 183 

I I : I : Ihlllll III I III 

Db 141 QKSRFDWFADAIFPCSELLAELLNTPLVY SLRFTPGYNFEKHCGGFLFPPSY 193 

Qy 184 VPVFRSLLTDHMDFWGRVKN — FLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHLL 241 

III I I : I I I I I I I I :::: I I I I : : I I I II : 

Db 194 VPVVMSELSDHMTFMERVKNMIYMLYFDFC-FQI YAMKKWDQFYSE — VLGRPTTLSETM 250 

Qy 242 LKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTLG 301 

I I : : I I : : I I I I I I I : I I I I I I I I : I : : : I I : I :: I I : : I I 
Db 251 GKADIWLIRNSWNFQFPHPLLPNVDFVGGLHCKPAKPLPKEMEEFVQSSGENGVWFSLG 310 

Qy 302 SMvTCTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVT^ 361 

I I I I I : : | | : | | | : | : | | | :: I : I ) : I I I I I 

Db 311 SMV-TNMKEERANVI ASALAQI PQKVLWRFD GKKPDTLGLNTRLYKWIPQNDLLGHP 366 

Qy 362 SIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETLA 421 

I I : I I I I I I I I I I I I I I II I I I I I I I : I : : : I I I : : : I 
Db 367 KTRAFITHGGSNGIYEAIYHGVPMVGIPLFADQPDNIAHMKAKGAAVRLDFDTMSSTDLV 426 

Qy 422 LKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPWH 481 

: I : : I II : II hi I I I I : I : : I I I I : I I 
Db 427 NALKTVINDPLYKENVMKLSRIQHDQPVKPLDRAVFWIEFVMRHKG7VKHLRPAAHDLTWF 486 

Qy 482 EQYLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKE 522 

: : I I I I I : : : I I : I I I I : 

Db 487 QYHSLDVIGFLLACVATVIFIIMKCCLFCFW — KFARKGKK 525 



RESULT 13 
Q8BWQ1 

ID Q8BWQ1 PRELIMINARY; PRT; 534 AA. 



AC Q8BWQ1; 

DT 01-MAR-2003 (TrEMBLrel. 23, Created) 

DT 01-MAR-2003 (TrEMBLrel. 23, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE Similar to UDP glucuronosyltransf erase UGT2A3 . 

GN 2010321J07RIK. 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=C57BL/6J; TISSUE=Liver ; 

RX MEDLINE=22354683; PubMed=12466851; 

RA The FANTOM Consortium, 

RA the RIKEN Genome Exploration Research Group Phase I & II Team; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

DR EMBL; AK050327; BAC34 191.1; 

DR MGD; MGI: 1919344; 2010321 J07Rik . 

DR GO; GO: 0016758; F: transferase activity, transferring hexosyl . . .; IEA. 

DR GO; GO: 0008152; P : metabolism; IEA. 

DR InterPro; IPR002213; UDP_gluco_trans . 

DR Pfam; PF00201; UDPGT; 1. , 

DR PROSITE; PS00375; UDPGT; 1. 

SQ SEQUENCE 534 AA; 61119 MW; 3BF4F591395B1620 CRC64 ; 

Query Match 24.8%; Score 685.5; DB 11; Length 534; 

Best Local Similarity 33.5%; Pred. No. 1.2e-48; 

Matches 171; Conservative 83; Mismatches 195; Indels 61; Gaps 13; 

Qy 34 SHYLLMDRVSQI LQDHGHN VTMLNH KRGPFMPDFKKEEKSYQVIS 78 

I I : I : : : I II 11*1 • : I : : : I I : 

Db 34 SHWLNLKTI LEELGARGHEVTVLKYPS I I I DQSKRI PLHFENI PLLYEI ETAENRLNEI A 93 

Qy 79 WLA PEDHQREFKKSF-DFFLEETLGGRGKFENLLNVLEYLALQCSHFLNRKDIMDS 133 

II I I I : I I I I : I III:: I I : I I 

Db 94 NLAVNVI PNLSLWEAAKTLQDFFLQVT GDFESI CRSVLYNQKFMDK 139 

Qy 134 LKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFG SLEFG-LPIPLSYVPVFRS 18 9 

I:: :|:|::: I hill III I II I Mill MM I 

Db 140 LRDAQYDVWIDPWPCGELVAEVXQIPFWTLRFSMGYYMEKHCGQLPIPLSWPVWIS 199 

Qy 190 LLTDHMDFWGRVKN — FLMFFSFCRRQQHMQSTFDNTIKEHF TEGSRPVLSHLLLKA 244 

I I I : I I I I I I I : I : : I : I : I II : I I 

Db 200 ELTDNMTFTERVKNMMFSLLFEYWLQQ YDFAFWDQFYSETLGRPTTFCKTVGKA 253 

Qy 245 ELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTLGSMV 304 

: : I I : : : I I I I I I MM! I I I I : I : : : I \ : I : I I : : I I I I I 
Db 254 DIWLIRTYWDVEFPRPYLPNFEFVGGLHCKPAKPLPKEMEEFVQSSGEHGWVFSLGSMV 313 

Qy 305 NTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHIJU^NVKIVDWLPQSDLLAHPSIR 364 

I : : I : I I I : I : I I : I : : : I : I I : I I I I I : 

Db 314 KNL-TEEKANLIASVLAQIPQKVLWRYS GKKPATLGSNTRLFNWI PQNDLLGHPKTK 369 



Qy 



365 LFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETLALKM 424 



I • I I I I I I III I I I I i I I • I • I I I I I . 'Ill I 

Db 370 AFITHGGTNGIYEAIYHGVPMVGVPMLGDQPHNIAHMEAKGAALKVSISTMTSTDLLSAV 429 

Qy 425 KQIMEDKRYKSAAVAASVI LRSHPLS PTQRLVGWI DHVLQTGGATHLKP YVFQQPWHEQY 484 

: : : : II I : II I : I I I I I : I : : I I I I : I ' ' 

Db 430 RAVINEPSYKENAMRLSRIHHDQPWPLDRAVFWIEFVMRHKGAKHLRVAAHDLSWFQYH 489 

Qy 48 5 LFDVFVFLL GLTLGTLWLCGKL 506 

I I I I I :| I : : I II 

Db 490 SLDVIGFLLLCWTLTFIITKFCLFVCQKL 519 

RESULT 14 
Q9BDZ8 



ID Q9BDZ8 PRELIMINARY; PRT; 498 AA. 

AC Q9BDZ8; 

DT 01-JUN-2001 (TrEMBLrel. 17, Created) 

DT 01-JUN-2001 (TrEMBLrel. 17, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE UDP-galactose ceramide galactosyltrans f erase (EC 2.4.1.47) 

DE (Fragment) . 

GN CGT . 

OS Bos taurus (Bovine) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Cetartiodactyla; Ruminantia; Pecora; Bovoidea; 

OC Bovidae; Bovinae; Bos. 

OX NCBI_TaxID=9913; 

RN [1] 

RP SEQUENCE FROM N - A. 

RA Kapitonov D. ; 

RT "Molecular cloning and expression of ceramide galactosyltransf erases . 

RT Comparison with other glycosyltrans f erases . " ; 

RL Thesis (1997), Medical College of Virginia, Richmond, VA, USA. 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Kapitonov D. ; 

RL Submitted (FEB-1999) to the EMBL/ GenBank/DDB J databases. 

DR EMBL; AF129810; AAK16235.1; 

DR GO; GO: 0047263; F: N-acylsphingosine galactosyltransf erase act. . .; IEA. 

DR GO; GO: 0016758; F: transferase activity, transferring hexosyl . . .; IEA. 

DR GO; GO: 0008152; P :metabolism; IEA. 

DR InterPro; IPR002213; UDP_gluco_trans . 

DR Pfam; PF00201; UDPGT; 1. 

DR PROSITE; PS00375; UDPGT; 1. 

KW Glycosyltransf erase; Transferase. 

FT NON_TER 1 1 

SQ SEQUENCE 498 AA; 56674 MW; 83645A7079ACF582 CRC64; 



Query Match 24.7%; Score 684.5; DB 6; Length 498; 

Best Local Similarity 32.8%; Pred. No. 1.3e-48; 

Matches 161; Conservative 92; Mismatches 173; Indels 65; Gaps 12; 

Qy 58 HKRGPFMPDFKKEEKS YQVI SWLAPEDHQ — REFKKSF DFFLEETL GGRGK 106 

I : I I : I : : I I : I : : I III:: II 

Db 3 HERGHHTVFLRS EGRD IAPSNHYSLQRYPGIFNSTTSDAFLQSKMRNIFSGRLT 56 



Qy 107 FENLLNVLEYLALQCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAIL 166 



I I : : I : : I : : : I I I I I I : : : I : I I I I I I II : 
Db 57 AIELLDILDHYTKSCDMMVGNHALIQGLKQENFDLLLVDPNDMCGFLIAHLLGVKYAVFS 116 

Qy 167 STS FGSLEFGLPI PLS YVPVFRSLLTDHMDFWGRVKN FLMFFSFCRRQ 214 

: : I I I I I : I I I I I I I I I I I : I : I I I I : -I 

Db 117 T G L W Y P AE VGAP AP LA YVP E FN S L LT DHMN L LQ RMKNT GVYL I S R I G I S FL VL P K Y E RI M 176 

Qy 215 QHM QSTFDNTIKEHFTEGSRPVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVG 269 

I : I : I I :: I I :: I I : I I I I I I I I I I 

Db 177 QEYNLLPEKSMYD LVYGSSLWMLCTDVALEFPRPTLPNWYVG 219 

Qy 270 GLMEKPIKPVPQDLENFIAKFGDSGFVLVTLGSMVNTCQNPEIFKEMNNAFAHLPQGVIW 329 

I : : I I I : I : I : : : : I I I I I : I : I : : I : : I I I I I I I 

Db 220 G I LT K PAS P L P ED FQ RWVN GANEHG FVLVS FGAGVK YL ^- S E D I AT KLAGAL GRL P QKVI W 278 

Qy 330 KCQCSHWPKDVIiLAANvTaVDWLPQSDLLAHPSIRLFW 38 9 

: : I : I I : : : : I I I I : ! I I I : I : I : : I I I I I I I : I I I I : I I I I 
Db 279 RFSGT KPKNLGNNTRLI EWLPQNDLLGHSNI KAFLSHGGLNS I FETMYHGVPWGI P 335 

Qy 390 LFGDQPENMVRVEAKKFGVSIQLKKLKAETL^KMKQIMEDKRYKSAAVAASVILRSHPL 449 

I I I I : | : | | : | I I : : : I : I : : : : : I : I I I : 

Db 336 LFGDHYDTMIRVQAKGMGILLEWKTVTEGELYEALVKVINNPSYRQRAQKLSEIHKDQLR 395 

Qy 450 SPTQRLVGWIDHVLQTGGATHLKPYVFQQPWHEQYLFD-VFVFLLG LTLGTLW 501 

I I I I I I : : I : III:: II : : : I I I I I I I I I : I : 

Db 396 HPWRTVYWIDYILRHDGAHHIRAAVHQISFCQYFLLDIVFVLLLGAALFYFLLSWVTKF 455 

Qy 502 LCGKLLGMAVW 512 

:| :: ::| 
Db 456 ICRRI — RSLW 464 



RESULT 15 
Q9R110 

ID Q9R110 PRELIMINARY; PRT; 530 AA. 

AC Q9R110; 

DT 01-MAY-2000 (TrEMBLrel. 13, Created) 

DT 01-MAY-2000 (TrEMBLrel. 13, Last sequence update) 

DT 01-OCT-2003 (TrEMBLrel. 25, Last annotation update) 

DE UDP glucuronosyltransf erase UGT2A3 . 

OS Cavia porcellus (Guinea pig) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Hystricognathi ; Caviidae; Cavia. 

OX NCBI_TaxID=10141; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Hartley; TISSUE=Liver ; 

RX MEDLINE=994 10697; PubMed=104794 84 ; 

RA Smith S.A. , Nagalla S.R., Andrews D.P., Olsen G.D.; 

RT "Morphine regulation of a novel uridine diphosphate glucuronosyl- 

RT transferase in guinea pig pups following in utero exposure."; 

RL Mol. Genet. Metab. 68:68-77(1999). 

DR EMBL; AF175221; AAD51732.1; -. 

DR GO; GO: 0016758; F: trans f erase activity, transferring hexosyl . . .; IEA. 

DR GO; GO: 0016740; F: transferase activity; IEA. 

DR GO; GO: 0008152; P : metabolism; IEA. 

DR InterPro; IPR002213; UDP_gluco_trans . 



DR Pfam; PF00201; UDPGT; 1. 

DR PROSITE; PS00375; UDPGT; 1. 

KW Transferase. 

SQ SEQUENCE 530 AA; 59895 MW; EE2F394D3FD484E1 CRC64; 

Query Match 24.6%; Score 681; DB 11; Length 530; 

Best Local Similarity 32.2%; Pred. No. 2.9e-48; 

Matches 168; Conservative 88; Mismatches 227; Indels 38; Gaps 11; 

Qy 13 LLPGVLLSEAAKILTISTVGG SHYLLMDRVSQILQDHGHNVTMLNHKR 60 

: II I I : I I | | : | : : : | I I I I : I 

Db 1 MAPGKLASAVLLLLLCCAGSGFCGKVLVWPCEMSHWLNLKTLLEELVKRGHEVTVLTLSN 60 

Qy 61 GPFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETL GGRGKFENL 110 

I : : : : : I I II : : I : I : I : : 

Db 61 NLFIDYNRHPAFNFEVIP— VPTDKNMS-ENILNEFIELAVNVMPTMPLWQSGKLLQQFF 117 

Qy 111 LNVLEYLALQCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSF 170 

: : I I I I : : : : I I : : : I : : : : I I : I I I I I I I : I I 

Db 118 VQITEDLGLNCRNTVYNQSLMKKLRDSKYDVLVTDPVIPCGELVAEMLGVPFVNMLKFSM 177 

Qy 171 G-SLE FGLPIPLSYVPVFRSLLTDHMDFWGRVKN — FLMFFSFCRRQQHMQSTFDNT 224 

I : : I II I I I I I I I I I I I I I I I : I I I I : : I 

Db 178 GHTIEKYCGQLPAPPSYVPVPLGGLTTRMTFMERVKNMVFSVLFDFW-IQQYDYKFWDQF 236 

Qy 225 IKEHFTEGSRPVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLE 284 

I I I : : I I I : I I : : I : I I I I I I : I I I I I I I I : I : : : I 

Db 237 YSEAL — GRPTTLCEIMGKAEIWLIRTYWDFEFPRPYLPNFEFVGGLHCKPAKPLPKEME 294 

Qy 2 85 NFIAKFGDSGFVLWLGSMVlJTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDv^IAA 344 

I : I : I I : : I I I I I I : : I I : I I I : I : : I I 

Db 2 95 EFVQSSGEDGVWFSLGSMVl^NL-TEEKANLIASALAQIPQKVLWRYK GKKPATLGP 350 

Qy 345 hT^KIVDWLPQSDLLAHPSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVETVK 404 

I : : I I : I I : I I I I I : I : I I I I I I I I I I I I I I I I : I : I I I I : I : : : I I 
Db 351 NTRLFDWIPQNDLLGHPKTKAFITHGGSNGIYEAIYHGVPMVGMPIFSDQPDNLAGMKAK 410 

Qy 4 05 KFGVSIQLKKLKAETLALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQ 464 

I : : : : I : : : : I I I I : I I I : I I I : : I : 

Db 411 GAAVEWMNTMTSADLLGALRTVINDPTYKENAMKLSRIHHDQPVKPLDR^EWEFVMH 470 

Qy 4 65 TGGATHLKPYVFQQPWHEQYLFDVFVFLLGLTLGTLWLCGK 505 

II II: I : : II III . : I I 

Db 471 HKGAKHLRVAAHDLSWFQYHSLDVIGFLLACVASAILLVTK 511 
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ALIGNMENTS 



RESULT 1 
UDCl^RABIT 

ID UDC1_RABIT STANDARD; PRT; 502 AA. 

AC P36514; 

DT 01-JUN-1994 (Rel. 29, Created) 

DT 01-JUN-1994 (Rel. 29, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE UDP-glucuronosyltransf erase 2C1 microsomal (EC 2.4.1.17) (UDPGT) 

DE (Fragment) . 

GN UGT2C1 OR UGT2A2 . 

OS Oryctolagus cuniculus (Rabbit) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus. 

OX NCBI_TaxID-9986; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN^New Zealand white; TISSUE=Liver ; 

RX MEDLINE=93315511; PubMed=8325897 ; 

RA Tukey R.H., Pendurthi U.R., Nguyen N.T., Green M.D., Tephly T.R.; 

RT "Cloning and characterization of rabbit liver UDP- 

RT glucuronosyltransf erase cDNAs . Developmental and inducible expression 

RT of 4-hydroxybiphenyl UGT2B13 . " ; 

RL J. Biol. Chem. 268:15260-152 66(1993). 

CC -!- FUNCTION: UDPGT is of major importance in the conjugation and 



CC subsequent elimination of potentially toxic xenobiotics and 

CC endogenous compounds . 

CC -!- CATALYTIC ACTIVITY: UDP-glucuronate + acceptor = UDP + acceptor 

CC beta-D-glucuronoside . 

CC -!- SUBCELLULAR LOCATION: Microsomal. 

CC -!- SIMILARITY: Belongs to the UDP-glycosyltransf erase family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; L01083; AAA18023.1; -. 

DR InterPro; IPR002213; UDP_gluco_trans . 

DR Pfam; PF002 01; UDPGT; 1. 



DR 


PROSITE; 


PS00375; 


UDPGT; 1. 






KW 


Transferase; Glycosyltransf erase; Glycoprotein; 


Transmembrane ; 


KW 


Multigene 


family; 


Microsome . 






FT 


NON_TER 


1 


1 






FT 


TRANSMEM 


466 


481 


POTENTIAL. 




FT 


CARBOHYD 


177 


177 


N-LINKED (GLCNAC. , 


. .) (POTENTIAL). 


FT 


CARBOHYD 


288 


288 


N-LINKED (GLCNAC. . 


. .) (POTENTIAL) . 


SQ 


SEQUENCE 


502 AA; 


57449 MW; 


B6E65670BFAE1D35 


CRC64; 



Query Match 27.0%; Score 748.5; DB 1; 

Best Local Similarity 35.4%; Pred. No. 3.4e-51; 
Matches 180; Conservative 82; Mismatches 178; 



Length 502; 
Indels 69; 



Gaps 



Qy 


34 


Db 


7 


Qy 


72 


Db 


67 


Qy 


128 


Db 


107 


Qy 


184 


Db 


167 


Qy 


241 


Db 


223 


Qy 


301 


Db 


283 



1 1 



II I I : I 



-MLNHKRGPFMPDFKK EE 

: I : I : I I : : II 



14; 
71 



-KSYQV — ISWLAPEDHQREFKKSFDFFLEETLGGRGKFENLLNVLEYLALQCSHFLNR 127 
I ::: : I I III : I I I I : 



I : : : I II 



I I : 



I hill III 



'SFGSLE FGLPIPLSY 183 

II:: I I I I I I 



VPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQH-MQSTFDNTIKEHFTE — GSRPVLSHL 240 
II lllhll I : I I : I : : I I I : I I : : : : II : : 
VPGSTSGLTDNMSFVQRLKNWLLYLMNDMMFSHFMLSEWD EYYSKVLGRRTTICEI 222 

LLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTL 300 

: I I I : I I I : I : I I I I I I I I I I I I I I I : I : : I I I : I : I I : I I 

MGKAEMWLIRSYWDFEFPRPFLPNFEYVGGLHCKPAKPLPEELEEFVQSSGNDGVWFTL 282 

GSMVNTCQNPEIFKEMNN AFAHLPQGVIWKCQCSHWPKDVHLAANVKIVDWLPQSD 35-6 

III: II : : I : I I I : I I I : I : I I I :: : I : I I : I 

GSMI QN— LTEERSNLIASALAQIPQKVLWRYT GKKPATLGPNTRLFEWI PQND 334 



QY 



357 LLAHPSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLK 416 
II II I hllll I : III I I I I I I I I I I I I I I I : I : I I : I I I : I : : 



Db 335 LLGHPKTRAFITHGGTNGLYEAIYHGVPMVGIPLFGDQPDNIARVKAKGAAVDVDLRIMT 394 



Qy 417 AETLALKMKQIMEDKRYKSAAVAASVI LRSHPLS PTQRLVGWI DHVLQTGGATHLKP YVF 476 

: I : I : : : II I : II III I I I I : I : : I I I I : 

Db 395 TSSLLKALKDVINNPSYKENAMKLSRIHHDQPLKPLDRAVFWIEFVMRHKGARHLRV7VAH 454 

Qy 477 QQPWHEQYLFDVFVFLLGLTLGTLWLCGK 505 

I : I I I I I I I : : I I 

Db 455 DLTWFQYYSLDVWFLLTCVATIIFLAKK 483 

RESULT 2 
CGT_HUMAN 

ID CGT_HUMAN STANDARD; PRT; 541 AA. 

AC Q16880; 000196; 

DT 01-NOV-1997 (Rel. 35, Created) 

DT 01-NOV-1997 (Rel. 35, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE 2-hydroxyacylsphingosine 1-beta-galactosyltransf erase precursor 

DE (EC 2.4.1.45) (UDP-galactose-ceramide galactosyltransf erase) (Ceramide 

DE UDP-galactosyltransferase) (Cerebroside synthase) . 

GN UGT8 OR CGT OR UGT4 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=96299661; PubMed-8 661025 ; 

RA Bosio A., Binczek E., Lebeau M.M., Fernald A. A. , Stoffel W.; 

RT "The human gene CGT encoding the UDP-galactose ceramide galactosyl 

RT transferase (cerebroside synthase) : cloning, characterization, and 

RT assignment to human chromosome 4, band q26. lf ; 

RL Genomics 34:69-75(1996). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=97242209; PubMed-9125199 ; 

RA Kapitonov D.E., Yu R.K.; 

RT "Cloning, characterization, and expression of human ceramide 

RT galactosyltransferase cDNA. " ; 

RL Biochem. Biophys . Res. Commun. 232:44 9-453(1997). 

CC -!- FUNCTION: CATALYZES THE TRANSFER OF GALACTOSE TO CERAMIDE, A KEY 
CC ENZYMATIC STEP IN THE BIOSYNTHESIS OF GALACTOCEREBRO SIDES, WHICH 

CC ARE ABUNDANT SPHINGOLIPIDS OF THE MYELIN MEMBRANE OF THE CENTRAL 

CC NERVOUS SYSTEM AND PERIPHERAL NERVOUS SYSTEM. 

CC -!- CATALYTIC ACTIVITY: UDP-galactose + 2- (2-hydroxyacyl ) sphingosine - 
CC UDP + 1- (beta-D-galactosyl) -2- (2-hydroxyacyl) sphingosine. 

CC -!- PATHWAY: Galactocerebroside biosynthesis. 

CC .-!- SIMILARITY: Belongs to the UDP-glycosyltransf erase family. 

CC — 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 



cc 

DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 



EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 
EMBL; 



U30930; AAC50565.1; 
U32370; AAC50815.1; 
U31353; AAC50815.1; 

AAC50815.1; 
AAC50815.1; 
AAC50815.1; 
AAC51187.1; 



JOINED. 
JOINED. 
JOINED. 
JOINED. 



U31461; 
U31658; 
U31861; 
U62899; 
Genew; HGNC: 12555; UGT8 . 
MIM; 601291; 

GO; GO:0008489; F: UDP-galactose-glucosylceramide beta-1, 4-gal . 
GO; GO: 0007417; P: central nervous system development; TAS. 
GO; GO:0007422; Prperipheral nervous system development; TAS. 
InterPro; IPR002213; UDP_gluco_trans . 
Pfam; PF00201; UDPGT; 1. 



TAS, 



DR 


PROSITE; 


PS00375; 


UDPGT; 1. 




KW 


Transferase; Glycc 


syltransferase; Glycoprotein; Transmembrane; Signal; 


KW 


Microsome 








FT 


SIGNAL 


1 


20 


POTENTIAL. 


FT 


CHAIN 


21 


541 


2-HYDROXYACYLSPHINGOSINE 1-BETA- 


FT 








GALACTO SYLTRANSFERASE . 


FT 


TRANS MEM 


472 


492 


POTENTIAL. 


FT 


CARBOHYD 


78 


78 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


333 


333 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


442 


442 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CONFLICT 


99 


99 


T -> P (IN REF. 2) . 


FT 


CONFLICT 


116 


116 


L -> M (IN REF. 2) . 


FT 


CONFLICT 


356 


356 


L -> V (IN REF. 2) . 


FT 


CONFLICT 


379 


379 


L -> V (IN REF. 2) . 


SQ 


SEQUENCE 


541 AA; 


61455 MW 


; EC532798F7E15834 CRC64; 


Query Match 




25.3%; 


Score 699; DB 1; Length 541; 



Best Local Similarity 32.2%; Pred. No. 2.9e-47; 
Matches 166; Conservative 95; Mismatches 185; Indels 



70; Gaps 12; 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 



13 LLPGVLLSEAAKILTISTV-GGSHYLLMDRVSQILQDHGHN-VTMLNHKRGPFMPDFKKE 70 

I I : :: I I I I : : : II : : : I : I I : I : I : | 
11 LWSAVGIAKAAKI I IVPPIMFESHMYI FKTLASALHERGHHTVFLLSEGRD 61 

71 EKSYQVI SWLAPEDHQ — REFKKSF DFFLEETL GGRGKFENLLNVLEYLAL 119 

: I I : I : : I III:: II I : : I : : 

62 IAPSNHYSLQRYPGIFNSTTSDAFLQSKMRNIFSGRLTAIELFDILDHYTK 112 

120 QCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPI 179 

I : : : I I I I I : : : I : I I I : I I I I : : : | | | 
113 NCDLMVGNHALIQGLKKEKFDLLLVDPNDMCGFVIAHLLGVKYAVFSTGLWYPAEVGAPA 172 

180 PLSYVPVFRSLLTDHMDFWGRVKN FLMFFSFCRRQQHM QSTFD 222 

I I : I I I I I I I I I I : I : I I ! I : : | | : | : | 

173 PLAYVPEFNSLLTDRMNLLQRMKNTGVYLISRLGVSFLVLPKYERIMQKYNLLPEKSMYD 232 

223 NTIKEHFTEGSRPVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQD 282 

I : : I I : : I I : I I I I I I I I I I I : : I I I : I : I 
233 LVHGSSLWMLCTDVALEFPRPTLPNWYVGGILTKPASPLPED 275 

283 LENFIAKFGDSGFVLVTLGSMVNTCQNPEIFKEMNNAFMLPQGVIWKCQCSHWPKDVHL 342 
I : : : : I I I I I : I : I : : I : : I III I I I : | | : | 



Db 276 LQRWVNGANEHGFVLVS FGAGVKYL- S EDIANKLAGALGRLPQKVIWRFS GPKPKNL 331 

Qy 343 AANVKIVTWLPQSDLLAHPSIRLFVTHGGQNSIMEAIQHGVP^ 402 

I I : : : I I I I : I I I I I : I : : I I I I I I I : I I I I : I I I I I I I I : I M : 

Db 332 GNNTKLIEWLPQNDLLGHSKIKAFLSHGGLNSIFETMYHGVPWGIPLFGDHYDTMTRVQ 391 

Qy 403 AKKFGVS IQLKKLKAETLALKMKQIMEDKRYKSAAVAASVI LRSHPLS PTQRLVGWI DHV 462 

I I I : : : I : : I : : : : : I : I I I : I I I : I I I : : 

Db 392 AKGMGILLEWKTVTEKELYEALVKVINNPSYRQRAQKLSEIHKDQPGHPVNRTIYWIDYI 451 

Qy 4 63 LQTGGATHLKPYVFQQPWHEQYLFDV- FVFLLGLTL 497 

: : llll: II : : : I I : I I I I I I 

Db ■ 452 IRHNGAHHLRAAVHQISFCQYFLLDIAFVLLLGAAL 487 

RESULT 3 
CGT_MOUSE 

ID CGT_MOUSE STANDARD; PRT; 541 AA. 

AC Q64676; Q61634; 

DT 01-NOV-1997 (Rel. 35, Created) 

DT 01-NOV-1997 (Rel. 35, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE 2-hydroxyacylsphingosine 1-beta-galactosyltransf erase ..precursor 

DE (EC 2.4.1.45) (UDP-galactose-ceramide galactosyltrans f erase) (Ceramide 

DE UDP-galactosyltransf erase) (Cerebroside synthase) . 

GN UGT8 OR CGT OR UGT4 . 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxI D=l 0 090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=BALB/c; 

RA Bosio A. , Binczek E., Stoffel W. ; 

RL Submitted (APR-1996) to the EMBL/ GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Coetzee T., Li X., Fujita N., Marcus J., Suzuki K., Francke U., 

RA Popko B. ; 

RL Submitted (FEB-1996) to the EMBL/ GenBank/DDBJ databases. 

CC -!- FUNCTION: CATALYZES THE TRANSFER OF GALACTOSE TO CERAMIDE, A KEY 

CC ENZYMATIC STEP IN THE BIOSYNTHESIS OF GALACTOCEREBROSIDES, WHICH 

CC ARE ABUNDANT SPHINGOLIPIDS OF THE MYELIN MEMBRANE OF THE CENTRAL 

CC NERVOUS SYSTEM AND PERIPHERAL NERVOUS SYSTEM. 

CC -!- CATALYTIC ACTIVITY: UDP-galactose + 2- ( 2-hydroxyacyl ) sphingosine = 

CC UDP + 1- (beta-D-galactosyl) -2- (2-hydroxyacyl) sphingosine. 

CC -!- PATHWAY: Galactocerebroside biosynthesis. 

CC -!- SIMILARITY: Belongs to the UDP-glycosyltrans f erase family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation ~ 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC ■ 



DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
KW 
FT 
FT 
FT 



EMBL; X92122; CAA63090.1; 

EMBL; X92123; CAA63091.1; 

EMBL; X92124; CAA63091.1; JOINED. 

EMBL; X92125; CAA63091.1; JOINED . 

EMBL; X92126; CAA63091.1; JOINED. 

EMBL; X92177; CAA63091.1; JOINED. 

EMBL; U48896; AAC53576.1; 

EMBL; U48892; AAC53576.1; JOINED. 

EMBL; U48893; AAC53576.1; JOINED. 

EMBL; U48894; AAC53576.1; JOINED. 

MGD; MGI: 109522; Ugt8 . 

InterPro; IPR002213; UDP_gluco_trans . 
Pfam; PF00201; UDPGT; 1. 
PROSITE; PS00375; UDPGT; 1. 

Transferase; Glycosyltransf erase; Glycoprotein; Transmembrane; Signal ; 
Microsome. 

SIGNAL 1 2 0 POTENTIAL. 

CHAIN 21 541 2-HYDROXYACYLSPHINGOSINE 1-BETA- 

GALACT O SYLT RAN SFE RASE. 



FT 


TRANSMEM 


472 


492 


POTENTIAL. 




FT 


CARBOHYD 


78 


78 


N-LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT 


CARBOHYD 


333 


333 


N-LINKED (GLCNAC. . 


.) (POTENTIAL) 


FT 


CARBOHYD 


442 


442 


N-LINKED (GLCNAC. . 


. ) (POTENTIAL) 


FT 


CONFLICT 


335 


335 


T -> S (IN REF. 2) . 




FT 


CONFLICT 


458 


458 


H -> R (IN REF. 2) . 




FT 


CONFLICT 


511 


511 


K -> E (IN REF. 2) . 




FT 


CONFLICT 


524 


524 


P -> R (IN REF. 2) . 




FT 


CONFLICT 


533 


534 


HI -> RV (IN REF. 2) . 


FT 


CONFLICT 


541 


541 


K -> R (IN REF. 2) . 




SQ 


SEQUENCE 


541 AA; 


61137 MW; B76F80A9B5326EE8 


CRC64; 


Query Match 




25.2%; 


Score 697; DB 1; 


Length 541; 


Best Local Similarity 


32.4%; 


Pred. No. 4.2e-47; 




Matches 167; 


Conservative 


95; Mismatches 184; 


Indels 70; 



Gaps 12; 



Qy 

Db 



13 LLPGVTiLSEAAKI LTI STV-GGSHYLLMDRVSQI LQDHGHN-VTMLNHKRGPFMPDFKKE 70 

I I : : I I I I : : : II : : : I : I I : I : I : I 
11 LWSAVGIARAAKIIIVPPIMFESHLYIFKTLASALHERGHHTVLLLSEGRD 61 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 



71 EKSYQVISWLAPEDHQ— REFKKSF DFFLEETL GGRGKFENLLNVLEYLAL 119 

: I I : I : : I III:: II | : : : | : : 

62 IAPSNHYSLQRYPGIFNSTTSDAFLQSKMRNIFSGRLTAVELVDILDHYTK 112 

12 0 QCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPI 17 9 

I : : :: II I I I : : : I : I I |:| I II : : : III 

113 NCDMMVGNQALIQGLKKEKFDLLLVDPNDMCGFVIAHLLGVKYAVFSTGLWYPAEVGAPA 172 



180 



222 



PLSYVPVFRSLLTDHMDFWGRVKN FLMFFS FCRRQQH MQSTFD 

I I : I I I I I I I I I I : I I : I I I I : : I I : I : I 

173 PLAYVPEFNSLLTDRMNFLERMKNTGVYLISRIGVSFLVLPKYERIMQKYNLLPAKSMYD 232 

223 NTIKEHFTEGSRPVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQD 282 

I : : I I : : I I : I I I I I I I I I I I : : I I I : I : I 
233 LVHGSSLWMLCTDVALEFPRPTLPNWYVGGILTKPASPLPED 275 

283 LENFIAKFGDSGFVXVTLGSMVNTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHL 342 
I : : : : : I I I I I : I : I : : I : : I III III: : I : I 



Db 



276 LQRWSGAQEHGFVLVSFGAGVKYL-SEDIANKIAGALGRLPQKVIWRFSGT KPKNL 331 



Qy 343 AANVKIVDWLPQSDLLAHPSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVE 402 

I I ::: I I I I : I I I I : I I I :: I I I I I I I : I I I I : I I I I I I I I : I I I : 

Db 332 GNNTKLI EWLPQNDLLGHSNI RAFLSHGGLNS I FETMYHGVPWGI PLFGDHYDTMTRVQ 391 

Qy 4 03 AKKFGVSIQLKKLKAETLALKMKQIMEDKRYKS7VAVAASVILRSHPLSPTQRLVGWIDHV 4 62 

III::: : I : : : : : I : I I I : I I I I I I ' : 
Db 392 AKGMGILLEWNTVTEGELYDALVKVINNPSYRQRAQKLSEIHKDQPGHPVNRTTYWIDYI 451 

Qy 4 63 LQTGGATHLKPYVFQQPWHEQYLFDV- FVFLLGLTL 497 

I : II II: II : : : I I : I I I I I I 

Db 452 LRHDGAHHLRSAVHQISFCQYFLLDIAFVLLLGAVL 487 



RESULT 4 
CGT RAT 



ID CGT_RAT STANDARD; PRT; 541 AA. 

AC Q09426; 

DT 01-NOV-1995 (Rel. 32, Created) 

DT 01-NOV-1995 (Rel. 32, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE 2^-hydroxyacylsphingosine l-.beta-galactosyltransf erase precursor 

DE (EC 2.4.1.45) (UDP-galactose-ceramide galactosyltransf erase ) (Ceramide 

DE UDP-galactosyltransferase) (Cerebroside synthase) . 

GN UGT8 OR CGT OR UGT4 . 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Wistar; TISSUE=Brain; 

RX MEDLINE-94052143; PubMed-7694285 ; 

RA Schulte S., Stoffel W. ; 

RT "Ceramide UDPgalactosyltransf erase from myelinating rat brain: 

RT purification, cloning, and expression."; 

RL Proc. Natl. Acad. Sci. U.S.A. 90:10265-10269(1993). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Sprague-Dawley; TISSUE=Brain; 

RX MEDLINE=94358923; PubMed=7521399 ; 

RA Stahl N . , Jurevics H. , Morell P., Suzuki K., Popko B.; 

RT "Isolation, characterization, and expression of cDNA clones that 

RT encode rat UDP-galactose : ceramide galactosyltransf erase ." ; 

RL J. Neurosci. Res. 38:234-242(1994). 

CC -!- FUNCTION: CATALYZES THE TRANSFER OF GALACTOSE TO CERAMIDE, A KEY 
CC ENZYMATIC STEP IN THE BIOSYNTHESIS OF GALACTOCEREBROSIDES , WHICH 

CC ARE ABUNDANT SPHINGOLIPIDS OF THE MYELIN MEMBRANE OF THE CENTRAL 

CC NERVOUS SYSTEM AND PERIPHERAL NERVOUS SYSTEM. 

CC -!- CATALYTIC ACTIVITY: UDP-galactose + 2- ( 2-hydroxyacyl ) sphingosine = 
CC UDP + 1- (beta-D-galactosyl) -2- (2-hydroxyacyl) sphingosine. 

CC -!- PATHWAY: Galactocerebroside biosynthesis. 

CC -!- TISSUE SPECIFICITY: BRAIN, RESTRICTED TO THE OLIGODENDROCYTE- 

CC CONTAINING CELL LAYERS OF CEREBRUM AND CEREBELLUM. 

CC -!- SIMILARITY: Belongs to the UDP-glycosyltransf erase family. 

CC 



CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; L21698; AAA16108.1; -. 

DR EMBL; U07683; AAA50212.1; -. 

DR PIR; A48801; A48801. 

DR InterPro; IPR002213; UDP_gluco_trans . 

DR Pfam; PF00201; UDPGT; 1. 

DR PROSITE; PS00375; UDPGT; 1. 

KW Transferase; Glycosyltransf erase; Glycoprotein; Transmembrane; Signal; 

KW Microsome. 



FT 


SIGNAL 


1 


20 


POTENTIAL. 


FT 


CHAIN 


21 


541 


2-HYDR0XYACYLSPHING0SINE 1-BETA- 


FT 








GALACTOSYLTRANSFERASE . 


FT 


TRANSMEM 


472 


492 


POTENTIAL. 


FT 


CARBOHYD 


78 


78 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


333 


333 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


CARBOHYD 


442 


442 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


SQ 


SEQUENCE 


541 AA; 


61126 


MW; 260D7603170151BB CRC64; 


Query Match 




24.7^ 


h; Score 684; DB 1; Length 541; 



Best Local Similarity 32.2%; Pred. No. 4.4e-46; 

Matches 165; Conservative 94; Mismatches 184; Indels 70; Gaps 12; 

Qy 13 LLPGVLLSEAAKILTISTV-GGSHYLLMDRVSQILQDHGHN-VTMLNHKRGPFMPDFKKE 7 0 

I | : : | | | | : : : II : : : I : I I : I : I : I 
Db 11 LWSAVGIARAAKIIIVPPIMFESHLYIFKTLASALHERGHHTVFLLSEGRD 61 

Qy 71 EKSYQVISWLAPEDHQ — REFKKSF DFFLEETL GGRGKFENLLNVLEYLAL 119 

: I : I : : I III:: II | : : : | : : 

Db 62 IDPSNHYSLQRYPGI FNSTTSDAFLQSKMRNI FSGRLTAVELVDILDHYTK 112 

Qy 120 QCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEFGLPI 179 

I : : : : I I I I I : :: I : I I hi II : : : I I I 
Db 113 NCDMMVGNQALIQGLKKEKFDLLLVDPNDMCGFVIAHLLGVKYAVFSTGLWYPAEVGAPA 172 

Qy 180 PLSYVPVFRSLLTDHMDFWGRVKN FLMFFS FCRRQQH MQSTFD 222 

I I : I I I I I I I II I : I I : I I II: : I I : I : I 

Db 173 PLAYVPEFNSLLTDRMNFLERMKNTGVYLISRMGVS FLVLPKYERIMQKYNLLPAKSMYD 232 

Qy 223 NTIKEHFTEGSRPVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQD 282 

I : : I I : : I I : I I I I I I I I I I I : : I I I : I : I 
Db 233 LVHGSSLWMLCTDVALEFPRPTLPNWYVGGILTKPASPLPED 275 

Qy 283 LENFIAKFGDSGFVLVTLGSMVNTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHL 342 

I : : : : I I I I I : I : I : : I : : I III III: : I : I 
Db 27 6 LQRWVDGAQEHGFVLVSFGAGVKYL-SEDIANKLAGALGRLPQKVIWRFSGT KPKNL 331 

Qy 34 3 AANWIVDWLPQSDLLAHPSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVE 402 

I I ::: I I I I : II I I : I I I :: I I I I I I I : I I II : I I I I I II I : I I I : 
Db 332 GNNTKLIEWLPQNDLLGHSNIRAFLSHGGLNSIFETMYHGVPWGIPLFGDHYDTMTRVQ 391 



Qy 403 AKKFGVS IQLKKLKAETLALKMKQIMEDKRYKSAAVAASVI LRSHPLS PTQRLVGWI DHV 462 

II I : : : : I : : : : : | : I | | : I I I III — 
Db 392 AKGMGILLEWNTVTEGELYDALVKVINNPSYRQRAQKLSEIHKDQPGHPVNRTTYWIDYI 451 

Qy 463 LQTGGATHLKP YVFQQPWHEQYLFDV- FVFLLG 4 94 

I : II II: II : : : I I : I I I I I 
Db 452 LRHDGAHHLRSAVHQI S FCQYFLLDI AFVLLLG 4 84 



RESULT 5 
UD11_HUMAN 

ID UD11_HUMAN STANDARD; PRT; 533 AA. 

AC P22309; 

DT 01-AUG-1991 (Rel. 19, Created) 

DT 01-AUG-1991 (Rel. 19, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE UDP-glucuronosyltransf erase 1-1 precursor, microsomal (EC 2.4.1.17) 

DE (UDP-glucuronosyltransferase 1A1) (UDPGT) (UGT1*1) (UGT1-01) (UGT1.1) 

DE (UGT-1A) (UGT1A) (Bilirubin specific UDPGT isozyme 1) (HUG-BR1) . 

GN UGT1A1 OR UGTl OR GNT1. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Liver; 

RX MEDLINE=91093210; PubMed=1898728 ; 

RA Ritter J.K., Crawford J.M., Owens I.S.; 

RT "Cloning of two human liver bilirubin UDP-glucuronosyltransferase 

RT cDNAs with expression in COS-1 cells."; 

RL J. Biol. Chem. 266:1043-1047(1991). 

RN [2] 

RP SEQUENCE FROM N.A., AND TISSUE SPECIFICITY. 

RX MEDLINE=92147680; PubMed=133 944 8 ; 

RA Ritter J.K., Chen F. , Sheen Y.Y., Tran H.M., Kimura S., Yeatman M.T., 

RA Owens I . S . ; 

RT "A novel complex locus UGTl encodes human bilirubin, phenol, and 

RT other UDP-glucuronosyltransferase isozymes with identical carboxyl 

RT termini."; 

RL J. Biol. Chem. 267:3257-3261(1992). 

RN [3] 

RP SEQUENCE FROM N.A. 

RX MEDLINE-21327373; PubMed=11434514 ; 

RA Gong Q.H., Cho J.W., Huang T., Potter C, Gholami N., Basu N.K., 

RA Kubota S., Carvalho S., Pennington M.W., Owens I.S., Popescu N.C.; 

RT "Thirteen UDP-glucuronosyltransferase genes are encoded at the human 

RT UGTl gene complex locus."; 

RL Pharmacogenetics 11:357-368(2001). 

RN [4] 

RP SEQUENCE FROM N.A. 

RA Gattung S., Stoneking T., Davidson T.; 

RL Submitted (MAR-1999) to the EMBL/ GenBank/DDB J databases. 

RN [5] 

RP SEQUENCE OF 1-50 FROM N.A. 

RA Ueyama H., Koiwai O., Soeda Y., Sato H., Satoh Y., Ohkubo I., 

RA Doida Y. ; 



RT "Analysis of the promoter of human bilirubin 

RT UDP-glucuronosyltransf erase gene (UGT1*1) in relevance to Gilbert's 

RT syndrome."; 

RL Hepatol. Res. 9:152-163(1997). 

RN [6] 

RP VARIANT CN-I PHE-375. 

RX MEDLINE-92339803; PubMed=1634 050 ; 

RA Bosma P.J., Chowdhury J.R., Huang T.-J., Lahiri P., Elferink R.P.J.O., 

RA van Es H.H.G., Lederstein M. , Whitington P.F., Jansen P.L.M., 

RA Chowdhury N.R.; 

RT "Mechanisms of inherited deficiencies of multiple UDP- 

RT glucuronosyltransf erase isoforms in two patients with Crigler-Na j j ar 

RT syndrome, type I."; 

RL FASEB J. 6:2859-2863(1992). 

RN [7] 

RP VARIANTS CN-II ARG-71 AND ASP-486. 

RX MEDLINE=94107323; PubMed=8280139 ; 

RA Aono S., Yaraada Y., Keino H., Hanada N., Nakagawa T., Sasaoka Y., 

RA Yazawa T., Sato H., Koiwai O.; 

RT "Identification of defect in the genes for bilirubin UDP-glucuronosyl- 

RT transferase in a patient with Crigler-Na j j ar syndrome type II."; 

RL Biochem. Biophys . Res. Commun. 197:1239-124 4(1993). 

RN [8] 

RP VARIANT CN-II ARG-331. 

RX MEDLINE=94102756; PubMed=82764 13 ; f 

RA Moghrabi N., Clarke D.J., Boxer M. , Burchell B. ; 

RT "Identification of an A-to-G missense mutation in exon 2 of the UGT1 

RT gene complex that causes Crigler-Naj jar syndrome type 2."; 

RL Genomics 18:171-173(1993). 

RN [9] 

RP VARIANT CN-I PHE-170 DEL. 

RX MEDLINE=94043159; PubMed=8226884 ; 

RA Ritter J.K., Yeatman M.T., Kaiser C. , Gridelli B., Owens I.S.; 

RT "A phenylalanine codon deletion at the UGT1 gene complex locus of a 

RT Crigler-Na j jar type I patient generates a pH-sensitive bilirubin UDP- 

RT glucuronosyltransf erase ." ; 

RL J. Biol. Chem. 268:23573-23579(1993). 

RN [10] 

RP VARIANTS CN-I VAL-292; GLU-308; ARG-357; THR-368; ARG-381; PRO-401 AND 

RP GLU-42 8. 

RX MEDLINE=95080780; PubMed=7989045; 

RA Labrune P., Myara A. , Hadchouel M., Ronchi F. f Bernard O., Trivin F. , 

RA Roy Chowdhury N. , Roy Chowdhury J., Munnich A. , Odievre M. ; 

RT "Genetic heterogeneity of Crigler-Na j jar syndrome type I: a study of 

RT 14 cases."; 

RL Hum. Genet. 94:693-697(1994). 

RN [11] 

RP VARIANTS CN GLU-175; ARG-177; TRP-209; ARG-276 AND PHE-375. 

RX MEDLINE=95081424; PubMed=7989595 ; 

RA Seppen J., Bosma P.J., Goldhoorn B . G . , Bakker C.T.M., 

RA Roy Chowdhury J. , Roy Chowdhury N., Jansen P.L.M., 

RA Oude Elferink R.P.J.; 

RT "Discrimination between Crigler-Na j j ar type I and II by expression of 

RT mutant bilirubin uridine diphosphate-glucuronosyltransf erase ." ; 

RL J. Clin. Invest. 94:2385-2391(1994). 

RN [12] 

RP VARIANTS GILBERT SYNDROME ARG-71; GLN-229 AND GLY-367. 



RC TISSUE=Liver, and Peripheral blood leukocytes; 

RX MEDLINE=95231122; PubMed=77 152 97 ; 

RA Aono S., Adachi Y. , Uyama E., Yamada Y., Keino H. , Nanno T . , 

RA Koiwai 0., Sato H. ; 

RT "Analysis of genes for bilirubin UDP-glucuronosyltransf erase in 

RT Gilbert 1 s syndrome . " ; 

RL Lancet 345:958-959(1995). 

RN [13] 

RP VARIANTS CN II ARG-71; TRP-209; GLN-229 AND ASP-486. 

RX MEDLINE=9 8 2 84535; PubMed= 9621515; 

RA Yamaraoto K. , Soeda Y., Kamisako T., Hosaka H., Fukano M. , Sato H. , 

RA Fujiyama Y., Dachi Y., Satoh Y., Bamba T . ; 

RT "Analysis of bilirubin uridine 5 1 -diphosphate (UDP)- 

RT glucuronosyltransferase gene mutations in seven patients with Crigler- 

RT Najjar syndrome type II."; 

RL J. Hum. Genet. 43:111-114(1998). 

RN [14] 

RP VARIANT GILBERT SYNDROME ASP-4 86. 

RX MEDLINE=98291073; PubMed=9627 603 ; 

RA Maruo Y., Sato H. , Yamano T., Doida Y. , Shimada M. ; 

RT "Gilbert syndrome caused by a homozygous missense mutation (Tyr486Asp) 

RT of bilirubin UDP-glucuronosyltransf erase gene."; 

RL J. Pediatr. 132:1045-1047(1998). 

CC -!- FUNCTION: UDPGT is of major importance in the conjugation and 

CC subsequent elimination of potentially toxic xenobiotics and 

CC endogenous compounds. This isoform glucuronidates bilirubin IX- 

CC alpha to form both the IX-alpha-C8 and IX-alpha-C12 monocon jugates 

CC and diconjugate. 

CC -!- CATALYTIC ACTIVITY: UDP-glucuronate + acceptor = UDP + acceptor 

CC beta-D-glucuronoside. 

CC -!- SUBCELLULAR LOCATION: Microsomal. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=l; 

CC Comment=A number of isoforms are produced. The different 

CC isozymes have a different N-terminal domain and a common 

CC C-terminal domain of 245 residues; 

CC Name^l; 

CC IsoId=P22309-l; Sequence=Displayed; 

CC -!- TISSUE SPECIFICITY: Expressed in liver. Not expressed in skin or 
CC kidney. 

CC -!- DISEASE: THE GILBERT'S SYNDROME IS SHOWN TO OCCUR AS A CONSEQUENCE 
CC OF REDUCED BILIRUBIN TRANSFERASE ACTIVITY. THE DISORDER, IS MOST 

CC OFTEN DETECTED IN YOUNG ADULTS WITH VAGUE NONSPECIFIC COMPLAINTS. 

CC A MORE SEVERE INHERITABLE DEFICIENCY IN BILIRUBIN ACTIVITY EXIST 

CC IN CRIGLER-NAJJAR (CN) : PATIENTS WITH TYPE I (RECESSIVE TRAIT) 

CC HAVE SEVERE HYPERBILIRUBINEMIA AND USUALLY DIE OF KERNICTERUS 

CC (BILIRUBIN ACCUMULATION IN THE BASAL GANGLIA AND BRAINSTEM NUCLEI) 

CC WITHIN THE FIRST YEAR OF LIFE. PATIENTS WITH TYPE II (DOMINANT 

CC TRAIT) HAVE LESS SEVERE HYPERBILIRUBINEMIA AND USUALLY SURVIVE 

CC INTO ADULTHOOD WITHOUT NEUROLOGIC DAMAGE . PHENOBARBITAL, WHICH 

CC INDUCES THE PARTIALLY DEFICIENT GLUCURONYL TRANSFERASE, CAN 

CC DIMINISH THE JAUNDICE. 

CC -!- SIMILARITY: Belongs to the UDP-glycosyltrans f erase family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 



cc 
cc 
cc 
cc 
cc 

DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 



use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 

EMBL; M84125; AAA61248.1; -. 
EMBL; M84124; AAA61247.1; ALT_SEQ. 
EMBL; M84122; AAA61247.1; JOINED. 
EMBL; M84123; AAA61247.1; JOINED. 
EMBL; M57899; AAA63195.1; -. 
EMBL; AF297093; AAG30424.1; 
EMBL; AC006985; AAF03522.1; 
EMBL; D87674; BAA25600.1; -. 
PIR; A39092; A39092. 
Genew; HGNC: 12530; UGT1A1. 
MIM; 191740; 
MIM; 143500; -. 
MIM; 218800; 
MIM; 606785; -. 

GO; GO: 0006789; P:bilirubin conjugation; TAS . 
GO; GO:0008210; P:estrogen metabolism; TAS. 
InterPro; IPR002213; UDP_gluco_trans . 
Pfam; PF00201; UDPGT; 1. 
PROSITE; PS00375; UDPGT; 1. 

Transferase; Glycosyltransf erase; Glycoprotein; Transmembrane; Signal; 



KW 


Multigene 


family; 


Microsome; 


Alternative splicing; Disease mutation. 


FT 


SIGNAL 


1 


25 


POTENTIAL. 


FT 


CHAIN 


26 


533 


UDP-GLUCURONOSYLTRANSFERASE 1-1. 


FT 


TRANSMEM 


491 


507 


POTENTIAL. 


FT 


CARBOHYD 


102 


102 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


295 


295 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


FT 


CARBOHYD 


347 


347 


N-LINKED (GLCNAC. . .) (POTENTIAL) . 


FT 


VARIANT 


71 


71 


G -> R (IN CRIGLER-NAJJAR TYPE II AND 


FT 








GILBERT SYNDROME) . 


FT 








/FTId=VAR 009504. 


FT 


VARIANT 


170 


170 


MISSING (IN CRIGLER-NAJJAR TYPE I; HAS 


FT 








NEARLY NORMAL ACTIVITY AT PH 7.6 AND IS 


FT 








INACTIVE AT PH 6.4) . 


FT 








/FTId=VAR_007695. 


FT 


VARIANT 


175 


175 


L -> E (IN CRIGLER-NAJJAR TYPE II). 



Query Match 24.5%; Score 679; DB 1; Length 533; 

Best Local Similarity 33.8%; Pred. No. l.le-45; 

Matches 175; Conservative 93; Mismatches 211; Indels 38; 



Gaps 13; 



Qy 

Db 



8 LLVGFLL — PGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKRGPFMP 65 
I : : I I I I :: I I I I I I I I I I : I I III I I : : I I 
11 LVLGLLLCVLGPWSHAGKI LLI - PVDGSHWLSMLGAIQQLQQRGHEI WL AP 62 



Qy 

Db 



66 D FKKEEKSYQVI SWLAPEDHQRE- FKKSF DFFLEETLGGRGKFENLL 111 

I : : : I : : : I Nihil III:: : : : 

63 DAS L Y I RDGAF YT L KT Y — PVPFQREDVKESFVSLGHNVFENDSFLQRVI KTYKKIK 117 



Qy 

Db 



112 NVLEYLALQCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFG 171 

I I I I I : I : : I I I : I I : : : : I I : : I : I II I 
118 KDSAMLLSGCSHLLHNKELMASLAESSFDVMLTDPFLPCSPIVAQYLSLPTVFFLHALPC 177 



Qy 172 SLEF GLPIPLSWPVFRSLLTDHMDFWGRVKNFLMFFSFCRRQQHMQSTFDNTIKEH 228 

I I I I I I I I I I I : I I I I I I I I I : I I : I : I : 

Db 178 SLEFEATQCPNPFSYVPRPLSSHSDHMTFLQRVKNMLIAFSQNFLCDWYSPY-ATLASE 236 

Qy 229 FTEGSRPVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIA 288 

I : : II I : I III I : I I : : I I I : I I I : I : I : I : I 

Db 237 FLQ-REVTVQDLLSSASVWLFRSDFVKDYPRPIMPNMVFVGGINCLHQNPLSQEFEAYIN 2 95 

Qy 28 9 KFGDSGFVLVTLGSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHLAANVKI 34 8 

I : I I : : I I I I I : : : : I : I I I : I : : : : I I I : 
Db 296 ASGEHGI WFSLGSMVSEI PEKKAM-AIADALGKI PQTVLWRYTGT RPSNLANNTIL 351 

Qy 349 VDWLPQSDLLAHPSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGV 408 

I I I I I : I I I I I I I : I I I : : I : I : I I I I I : I I I I I I : I hi I II 
Db 352 VKWLPQNDLLGHPMTRAFITHAGSHGWESICNGVPMV1WPLFGDQMDNAKRM 411 

Qy 4 09 S I QLKKLKAETLALKMKQIMEDKRYKSAAVAASVI LRSHP LS PTQRLVGWI DHVLQTGGA 4 68 

II :| :: || II : I : : I : I I |:: |:: II 

Db 412 TLNVLEMTSEDLENALKAVINDKSYKENIMRLSSLHKDRPVEPLDLAVFWVEFVMRHKGA 471 

Qy 469 THLKPYVFQQPWHEQYLFDVFVFLLGLTLGTLWLCGK 505 

I I : I I : : : II I I I : I :: I 

Db 472 PHLRPAAHDLTWYQYHS LDVI GFLLAWLTVAFI T FK 508 



RESULT 6 
UDB2__RAT 

ID UDB2_RAT STANDARD; PRT; 530 AA. 

AC P08541; 

DT 01-AUG-1988 (Rel. 08, Created) 

DT 01-AUG-1988 (Rel. 08, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE UDP-glucuronosyltransf erase 2B2 precursor, microsomal (EC 2.4.1.17) 

DE (UDPGT) (3-hydroxyandrogen specific) (UDPGTR-4) (RLUG23) . 

GN UGT2B2 . 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Liver; 

RX MEDLINE=87033594; PubMed=2429951 ; 

RA McKenzie P.I. ; 

RT "Rat liver UDP-glucuronosyltrans f erase . cDNA sequence and expression 

RT of a form glucuronidating 3-hydroxyandrogens . " ; 

RL J. Biol. Chem. 261:14112-14117(1986). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=91369480; PubMed=1909872 ; 

RA Haque.S.J., Peterson D.D., Nebert D.W., McKenzie P.I.; 

RT "Isolation, sequence, and developmental expression of rat UGT2B2: the 

RT gene encoding a constitutive UDP glucuronosyltransf erase that 

RT metabolizes etiocholanolone and androsterone . " ; 

RL DNA Cell Biol. 10:515-524(1991). 

RN [3] 

RP SEQUENCE OF 30-530 FROM N.A. 



RC TISSUE-Liver; 

RX MEDLINE=86120371; PubMed=3003696; 

RA Jackson M.R., Burchell B.; 

RT "The full length coding sequence of rat liver androsterone UDP- 

RT glucuronyltransf erase cDNA and comparison with other members of this 

RT gene family."; 

RL Nucleic Acids Res. 14:779-795(1986). 

CC -!- FUNCTION: UDPGT IS OF MAJOR IMPORTANCE IN THE CONJUGATION AND 
CC SUBSEQUENT ELIMINATION OF POTENTIALLY TOXIC XENOBIOTICS AND 

CC ENDOGENOUS COMPOUNDS. 2B2 ACTS ON VARIOUS ENDOGENOUS STEROIDS, 

CC ESPECIALLY ETIOCHOLANOLONE AND ANDROSTERONE. 

CC -!- CATALYTIC ACTIVITY: UDP-glucuronate + acceptor = UDP + acceptor 

CC beta-D-glucuronoside . 

CC -!- SUBCELLULAR LOCATION: Microsomal. 

CC -!- INDUCTION: Constitutively expressed. 

CC -!- SIMILARITY: Belongs to the UDP-glycosyltransf erase family. 

CC ; 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 



CC 








DR 


EMBL; J02589; AAA42314.1; 




DR 


EMBL; M74439; -; NOT_ANNOTATED_CDS . 




DR 


EMBL; X03478; CAA27198.1; 




DR 


PIR; A40467; A40467. 




DR 


InterPro; 


IPR002213; UDP gluco trans. 




DR 


Pfam; PF002 01; UDPGT; 1. 




DR 


PROSITE; 


PS00375; UDPGT; 1. 




KW 


Transferase; Glycosyltransf erase; Glycoprotein; 


Transmembrane ; 


KW 


Multigene 


family; Microsome. 




FT 


SIGNAL 


1 23 BY SIMILARITY . 




FT 


CHAIN 


24 530 UDP- GLUCURONO S YLT RAN S FERAS E 2B2. 


FT 


TRANSMEM 


494 510 POTENTIAL. 




FT 


CARBOHYD 


316 316 N-LINKED (GLCNAC. 


. .) (POTENTIAL) 


FT 


CONFLICT 


159 159 D -> E (IN REF. 3) 




FT 


CONFLICT 


286 286 A -> S (IN REF. 3) 




FT 


CONFLICT 


351 351 N -> I (IN REF. 3) 




FT 


CONFLICT 


363 363 L -> I (IN REF. 3) 




SQ 


SEQUENCE 


530 AA; 60985 MW; F2FFF3E23E2D75B2 


CRC64; 



Query Match 24.5%; Score 677.5; DB 1; Length 530; 

Best Local Similarity 33.5%; Pred. No. 1.4e-45; 

Matches 170; Conservative 87; Mismatches 218; Indels 33; Gaps 13; 

Qy 34 SHYLLMDRVSQILQDHGHNVTMLNHKRGPFMPDFKKEEKSYQVISW-LAPEDHQREFKKS 92 

II::: : I I I I I : I I : I : : : : I : : : : I II 

Db 34 SHWMNIKIILDELVQRGHEVTVLKPSAYFFLDPKKSSDLKFEIFSTSISKDELQNHFIKL 93 

Qy 93 FDFFLEE TLGGRGKFENLLNVLEYLALQ-CSHFLNRKDIMDSLKNENFDMVIVET 146 

I : I I : I I : III : : I : I I : I I : : : 

Db 94 LDVWTYELPRDTCLSYSPILQNLVYEFSYFYLSICKDAVSNKQLMTKLQESKFDVLFADP 153 



Qy 



147 FDYCPFLIAEKLGKPFVAILSTSFG-SLEFGLP 1 PLSYVPVFRSLLTDHMDFWGRVK 202 



I Mill II: Mil II : M MM 

Db 154 VASCGDLIAELLHIPFLYSLSFSPGHKLEKSIGKFILPPSYVPVILSGLAGKMTFIDRVK 213 

Qy 203 NF--LMFFSF-CRRQQHMQ-STFDNTIKEHFTEGSRPVLSHLLLKAELWFINSDFAFDFA 258 

I : : : I I I : I : M : I I = : | I : I I I : I 

Db 214 NMI CMLYFDFWFERLRHKEWDT FYS EI L GRPTTVDETMSKVEIWLIRSYWDLKFP 268 

Qy 259 RPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTLGSMVNTCQNPEIFKEMNN 318 

I Ml I : M I I I I I : I M : I I : I : I I : : I M M : : : I I 

Db 269 HPTLPNVDYIGGLHCKPAKPLPKDMEEFVQSSGEHGVWFSLGSMVS NMTEEKAN 323 

Qy 319 AFAHLPQGVIWKCQCSHWPKDVHLAANVKIVDWLPQSDLLAHPSIRLFVTHGGQNS 374 

| | : M I : I I I I : : M I I : I I I I I : M M M I 

Db 324 AIAWALAQIPQKVLWKFD GKTPATLGPNTRVYKWLPQNDLLGHPKTKAFVTHGGANG 380 

Qy 375 IMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKK^ 434 

: M I M : M : M I M M M : I : : I I I I : : = = ::::::: II 

Db 381 LYEAIYHGIPMIGIPLFGDQPDNIAHMVAKGAAVSLNIRTMSKLDFLSALEEVIDNPFYK 440 

Qy 435 SAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPWHEQYLFDVFVFLLG 494 

: || | : | MM:::: II MM I I : : : I I Ml 

Db 441 KNVMLLSTIHHDQPMKPLDRAVFWIEFIMRHKGAKHLRPLGHNLPWYQYHSLDVIGFLLT 500 

Qy 495 LTLGTLWLCGK-LLGMAVWWLRGARKVK 521 

I Mil : : : : MM 
Db 501 CFAVIAALTVKCLLFMYRFFVKKEKKMK 52 8 



RESULT 7 
UDBK MAC FA 



ID UDBK_MACFA STANDARD; PRT; 530 AA. 

AC 077649; 

DT 15-DEC-1998 (Rel. 37, Created) 

DT 15-DEC-1998 (Rel. 37, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE UDP-glucuronosyltransf erase 2B20 precursor, microsomal (EC 2.4.1.17) 

DE (UDPGT) . 

GN UGT2B2 0. 

OS Macaca fascicularis (Crab eating macaque) (Cynomolgus monkey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Macaca. 

OX NCBI_TaxID=9541; 

RN [1] 

RP SEQUENCE FROM N.A. , AND CHARACTERIZATION. 

RC TISSUE=Liver, and Prostate; 

RX MEDLINE=99112924; PubMed=98 95303 ; 

RA Barbier 0., Belanger A., Hum D.W.; 

RT "Cloning and characterization of a simian UDP-glucuronosyltrans f erase 

RT enzyme UGT2B20, a novel C19 steroid-conjugating protein."; 

RL Biochem. J. 337:567-574(1999). 

CC -!- FUNCTION: UDPGTs are of major importance in the conjugation and 

CC subsequent elimination of potentially toxic xenobiotics and 

CC endogenous compounds. This isozyme has glucuronidating capacity 

CC with androgens, such as testosterone, dihydrotestosterone (DHT) 

CC and 3alpha-Diol. It is also active on catecholoes trogens including 

CC 1, 3, 5, 10-oestratriene-3, 4-diol-17-one . 



CC -!- CATALYTIC ACTIVITY: UDP-glucuronate + acceptor = UDP + acceptor 

CC beta-D-glucuronoside . 

CC -!- SUBCELLULAR LOCATION: Microsomal. 

CC -!- SIMILARITY: Belongs to the UDP-glycosyltransf erase family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb~sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF072223; AAD08808.1; -. 

DR InterPro; IPR002213; UDP_gluco_trans . 

DR Pfam; PF00201; UDPGT; 1. 

DR PROSITE; PS00375; UDPGT; 1. 

KW Transferase; Glycosyltransf erase; Glycoprotein; Transmembrane; Signal; 

KW Multigene family; Microsome. 



FT 


SIGNAL 


1 


23 


POTENTIAL . 




FT 


CHAIN 


24 


530 


UDP-GLUCURONOSYLTRANSFERASE 2B20. 


FT 


TRANSMEM 


495 


515 


POTENTIAL. 




FT 


CARBOHYD 


65 


65 


N-LINKED (GLCNAC. 


. .) (POTENTIAL). 


FT 


CARBOHYD 


103 


103 


N-LINKED (GLCNAC. 


. . ) (POTENTIAL) . 


FT 


CARBOHYD 


316 


316 


N-LINKED (GLCNAC. 


. . ) (POTENTIAL) . 


FT 


CARBOHYD 


483 


483 


N-LINKED (GLCNAC. 


. .) (POTENTIAL). 


SQ 


SEQUENCE 


530 AA; 


61225 MW; 


A5EB47F8D517D8DA CRC64; 


Query Match 




24.3%; 


Score 673; DB 1; 


Length 530; 



Best Local Similarity 32.9%; Pred. No. 3.1e-45; 

Matches 160; Conservative 85; Mismatches 218; Indels 24; Gaps 10; 

Qy 34 SHYLLMDRVSQILQDHGHNVTMLNHKRGPFMPDFKKEEKSYQVI-SWLAPEDHQREFKKS 92 

II:: I : : I t 11*1 I: I I ::| : I I : I 

Db 34 SHWINMKTILEELVRRRHEVTVLTSSASTFVNDSKSSAIKFEVYPTSLTKNDMEDSLMKL 93 

Qy 93 FDFFLEETLGGRGKFENLLNVLEYLALQ CSHFLNRKDIMDSLKNENFDMVIV 14 4 

I : | : : | : | : I : I : I I I I I : : : 

Db 94 LDIWTYS I — SNST FLS YFSKLQELCWEYYYYSEKLCKDAVLNKKLMTKLKETKFDVI LA 151 

Qy 145 ETFDYCPFLIAEKLGKPFVAILSTSFG-SLE FGLPI PLSYVPVFRSLLTDHMDFWGR 200 

: : I I : I I II I I : I : I I I I II I I I hi I I I 

Db 152 DALNPCGELLAELFNIPFVYSLRFTVGYTFEKNGGGFLFPPSYVPVVMSELSDQMTFTER 211 

Qy 201 VKNFL — MFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHLLLKAELWFINSDFAFDFA 258 

:|| : ::| I II :| I I I : ll|:| I : : |:| 
Db 212 I KNMIHKLYFDFW- FQIHDI KKWDQFYSE — VLGRPTTLFETMRKAEMWLI RTYWDFEFP 2 68 

Qy 259 RPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTLGSMVNTCQNPEIFKEMNN 318 

Mill : I I I I | | | I : I : : : I I : I : : I I : : I I I I : : : I : : 

Db 269 RPFLPNVDFVGGLHCKPAKPLPKEMEEFVQSSGENGVWFSLGSMISN-MSEERANMIAS 327 

Qy 319 AFAHLPQGVIWKCQCSHWPKDVIilAANVKIVDWLPQSDLLAHPSIRLFVTHGGQNSIMEA 378 

I I : I I I : I I | | : | : : M I I : I I I I I : I : I I I I I I I I 

Db 328 ALAQI PQKVLWKFD GKKPNTLGSNTRLYKWLPQNDLLGHPKTKAFITHGGTNGIYEA 384 



Qy 



379 IQHGVPMVGIPLFGDQPENMVT^VEAKKFGVSIQLKKLKAETLALKMKQIMEDKRYKSAAV 438 



I I I : I I I I I I I I I I : I : I :: I :|: :: : : I :| :: : M l : 
Db 385 IYHGIPMVGIPLFADQHDNIVHMKVKGAALSVDIRTMSSRDLLNALKSVINEPIYKENAM 444 

Qy 439 AASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPWHEQYLFDVFVFLLGLTLG 498 

II I : I I I II : I : : Mil: I : : M Ml 

D b 445 KL S RI HHDQ PMK P LDRAVFW I E FVMRHKGAKHL RVAAHNLTWI Q YH S L DVI AFLLACVAA 504 

Qy 499 TLWLCGK 505 

: : : I 

Db 505 VIFIITK 511 



RESULT 8 
UDBD RABIT 



ID UDBD_RABIT STANDARD; PRT; 531 AA. 

AC P36512; 

DT 01-JUN-1994 (Rel. 29, Created) 

DT 01-JUN-1994 (Rel. 29, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE UDP-glucuronosyltransferase 2B13 precursor, microsomal (EC 2.4.1.17) 

DE (UDPGT) (EGT10) . 

GN UGT2B13. 

OS Oryctolagus cuniculus (Rabbit) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus. 

OX NCBI_TaxID=9986; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=New Zealand white; TISSUE=Liver ; 

RX MEDLINE=93315511; PubMed-83258 97 ; 

RA Tukey R.H., Pendurthi U.R., Nguyen N.T., Green M.D., Tephly T.R.; 

RT "Cloning and characterization of rabbit liver UDP- 

RT glucuronosyltransferase cDNAs . Developmental and inducible expression 

RT of 4-hydroxybiphenyl UGT2B13 . " ; 

RL J. Biol. Chem. 268:15260-15266(1993). 

CC -!- FUNCTION: UDPGT IS OF MAJOR IMPORTANCE IN THE CONJUGATION/AND 
CC SUBSEQUENT ELIMINATION OF POTENTIALLY TOXIC XENOBIOTICS AND 

CC ENDOGENOUS COMPOUNDS. ACTS ON SMALL PHENOLIC AGENTS SUCH AS 2- 

CC NAPHTOL AND 4-METHYLUMBELLIFERONE AS WELL AS BULKY PHENOLIC 

CC COMPOUNDS LIKE 2 -HYDROXY- AND 4-HYDROXYBIPHENYL. IN CONTRAST TO 

CC 2B16 IT IS ACTIVE TOWARD OCTYLGALLATE . 

CC -!- CATALYTIC ACTIVITY: UDP-glucuronate + acceptor = UDP + acceptor 

CC beta-D-glucuronoside. 

CC -!- SUBCELLULAR LOCATION: Microsomal. 

CC -!- DEVELOPMENTAL STAGE: Expressed primarily in adult rabbits. 

cc _!_ SIMILARITY: Belongs to the UDP-glycosyltransf erase family. 

CC 7 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb~sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC : 

DR EMBL; L01081; AAA18020.1; -. 
DR PIR; B47113; B47113. 



DR 


InterPro; 


IPR002213; UDP_ 


gluco trans . 




DR 


Pfam; PF00201; UDPGT; 1. 






DR 


PROSITE; 


PS00375; 


UDPGT; 


1. 




KW 


Transferase; Glycosyltransf erase; Glycoprotein; 


Transmembrane ; 


KW 


Multigene 


family; 


Microsome . 




FT 


SIGNAL 


1 


24 






FT 


CHAIN 


25 


531 


UDP-GLUCURONOSYLTRANSFERASE 2B13. 


FT 


TRANSMEM 


495 


511 


POTENTIAL. 




FT 


CARBOHYD 


69 


69 


N-LINKED ( GLCNAC . , 


. .) (POTENTIAL) 


FT 


CARBOHYD 


101 


101 


N-LINKED (GLCNAC. 


. .) (POTENTIAL) 


FT 


CARBOHYD 


317 


317 


N-LINKED (GLCNAC. 


. .) (POTENTIAL) 


SQ 


SEQUENCE 


531 AA; 


60552 


MW; 961DA58AC4CB9932 


CRC64; 



Query Match 24.2%; Score 670; DB 1; Length 531; 

.Best Local Similarity 32.6%; Pred. No. 5.4e-45; 

Matches 169; Conservative 90; Mismatches 206; Indels 54; Gaps 14; 



Qy 34 SHYLLMDRVSQI LQDHGHNVTML NHKRG PFMPDFKKEEKSYQVI SW 79 

I I : : I : I I I I I : I I : : I I s : I : I : I 

Db 35 SHWMNMKTILD7VLVQQGHEVTVLRSSASIVIGSNNESGIKFETFHTSYRKDEIENFFMDW 94 

Qy 80 LAPEDHQREFKKS FDFFLE ETLGGRGKFENLLNVLEYLAL QCSHFLNRKDIMDS 133 

: I : : : I I I : : I : I : I : I : I 

Db 95 F YKMIYNVSIESYWETFS LTKMVILKYSDICEDICKEVILNKKLMTK 141 

Qy 134 LKNENFDMVIVETFDYCPFLIAEKLGKP FVAILSTSFGSLEFGLPIPLSYVPV 186 

I : I I : I : : I : I I I I I I : I I I : 

Db 142 LQESRFDWLADPVSPGGELLAELLKIPLVYSLRGFVGYMLQKHGG GLLLPPSYVPV 198 

Qy 187 FRSLLTDHMDFWGRVKNFL — MFFSFCRRQQHMQSTFDNTIKEHFTEGSRPV-LSHLLLK 243 

II I I I I : I I : : I I : : : I I I I I hi 

Db 199 MMSGLGSQMTFMERVQNLLCVLYFDFW-FPKFNEKRWDQFYSEVL GRPVT FLELMGK 254 

Qy 244 AELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTLGSM 303 

I : : I I I : : I I I I I I I : : I I I I I I I : I I : : I : I : I : I I : : I I I I 
Db 255 ADMWLIRSYWDLEFPRPLLPNFDFIGGLHCKPAKPLPQEMEDFVQSSGEEGVWFSLGSM 314 

Qy 304 WTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVHLAANVKIVDWLPQSDLLAHPSI 363 

: : | : : I I I I I I : I : : I I : I : : I : I I : I I I I I 

Db 315 ISNL-TEERANVIASALAQLPQKVLWRFE GKKPDMLGSNTRLYKWI PQNDLLGHPKT 370 

Qy 364 RLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETLALK 423 

: I : I I I I I : I I I I I : I I I I : I I I I I I : I : I : : I I I : I I : : I 

Db 371 KAFITHGGANGVFEAIYHGIPMVGLPLFGDQLDNIVYMKAKGAAVKLNLKTMSSADLLNA 4 30 

Qy 424 MKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPWHEQ 483 

: I : : I II I : II hi I I I h : h : Mil: I : : 

Db 431 LKTVINDPSYKENAMTLSRIHHDQPMKPLDRAVFWIEYVMRHKGAKHLRVAAHDLTWYQY 490 



Qy - 48 4 YLFDVFVFLLGLTLGTLWLCGKLLGMAVWWLRGARKVKE 522 

: II Ml I : I I : : : M I h 

Db 491 HSLDVIGFLLACVAITTYLIVKCCLLVYRYVLGAGKKKK 529 



RESULT 9 
UDll_MOUSE 

ID UD11 MOUSE STANDARD; PRT; 535 AA. 



AC Q63886; 

DT 01-NOV-1997 (Rel. 35, Created) 

DT 01-NOV-1997 (Rel. 35, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE UDP-glucuronosyltransf erase 1-1 precursor, microsomal (EC 2.4.1.17) 

DE (UDPGT) (UGT1*1) (UGTl-01) (UGTl.l) (UGTlAl) (UGTBR1) . 

GN UGTlAl OR UGTl. 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=93219265; PubMed=84 64825; 

RA Kong A.N., Ma M. , Tao D., Yang L. ; 

RT "Molecular cloning of two cDNAs encoding the mouse bilirubin/phenol 

RT family of UDP-glucuronosyltransf erases (mUGTBr/p) . "; 

RL Pharm. Res. 10:4 61-465(1993). 

CC -!- FUNCTION: UDPGT is of major importance in the conjugation and 
CC subsequent elimination of potentially toxic xenobiotics and 

CC endogenous compounds . 

CC -!- CATALYTIC ACTIVITY: UDP-glucuronate + acceptor = UDP + acceptor 

CC beta-D-glucuronoside. 

CC SUBCELLULAR LOCATION: Microsomal. 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=l; 

CC Comment=A number of isoforms are produced. The different 

CC isozymes have a different N-terminal domain and a common 

CC C-terminal domain of 245 residues; 

CC Name=l; 

CC IsoId=Q63886-l; Sequence=Displayed; 

CC -!- INDUCTION: By dioxin. 

CC -!- SIMILARITY: Belongs to the UDP-glycosyltransf erase family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; S64760; AAB26033.2; -. 

DR MGD; MGI: 98898; Ugtlal. 

DR InterPro; IPR002213; UDP_gluco_trans . 

DR Pfam; PF002 01; UDPGT; 1. 

DR PROSITE; PS00375; UDPGT; 1. 

KW Transferase; Glycosyltransf erase; Glycoprotein; Transmembrane; Signal; 

KW Multigene family; Microsome; Alternative splicing. 

FT SIGNAL 1 29 POTENTIAL. 

FT CHAIN 30 535 UDP-GLUCUR0NOSYLTRANSFERASE 1-1. 

FT TRANSMEM 493 509 POTENTIAL . 

FT CARBOHYD 89 89 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 297 297 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 435 435 N-LINKED (GLCNAC. . .) (POTENTIAL). 

SQ SEQUENCE 535 AA; 60123 MW; B5AE3C209979BBB8 CRC64; 



ery Match 24.2%; Score 669; DB 1; Length 535; 

st Local Similarity 31.7%; Pred. No. 6.6e~45; 
atches 174; Conservative 101; Mismatches 216; Indels 58; Gaps 15; 

5 RVLLLVGFLLP GVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKRG 61 

hill: :l! ! | I : : | : : Mhl I MM III:: 
9 RLLLLLPYLLLCVFGPYASHAGRLL-VFPMDGSHWLSMLGVIQQLQQKGHEWVI A 63 

62 PFMPDFKKEEKSYQVISWLAPEDHQREFKKSFDFFLEETLGGRGKFEN LLNVLE-YL 117 

I li | : : j |:: I I I I I II I : : I : 

b 64 PEASIHIKEGSFYTLRKFPVPFQ KENVTATLVEL — GRTAFNQDS FLLRWKI YM 116 

y 118 ALQ CSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKP FVAI 165 

: : MM: MM: : I M : M i : : I M I I : 

Db 117 KVKRDSSMLLAGCSHLLHNAEFMASLEESHFDALLTDPFLPCGSIVAQYLTVPTVYFLNK 176 

Qy 166 LSTSFGSLEFGLPIPLSYVPVFRSLLTDHMDFWGRVKNFLMFFS FCRRQQHMQSTFD 222 

I ! I h! I I I I I ! : I I : I Mill: I M : 

Db 177 LPCSLDSEATQCPVPLSYVPKSLSFNSDRMNFLQRVKNVLLAVSENFMCRWYSPYGSLA 236 

Qy 223 NTIKEHFTEGSRPVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQD 282 

I : : II I :| MM I: IMMI |::||: IM h 

Db 237 TEILQ KEVTVQDLLSPASIWLMRSDFVKDYPRPIMPNMVTIGGINCLQKKPLSQE 291 

Qy 283 LENFIAKFGDSGFVLVTLGSMWTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHWPKDVTiL 342 

I : : I M I : : I I I II : : I : I Mi MM : : : I 
Db 292 FEAYWASGEHGIWFSLGSMVSEIPEKKAM-EIAEALGRIPQTVLWRYTGT RPSNL 347 

Qy 343 AANVKITOWLPQSDLIAHPSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVE 402 

II : I II M M I M I II : II I M II : II I II M I I I I I : I hi 

Db 348 AKNTILVKWLPQNDLIGHPKTRAFITHSGSHGIYEGICNGVPMViyiMPLFGDQMDNAKRME 407 

Qy 403 AKKFGVSIQLKKLKAETLALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHV 462 

: I | : : : : :| M M : : : II I : I : : I M M : : : I 
Db 408 TRGAGVTLNVLEMTADDLENALKTVINNKSYKENIMRLSSLHKDRPIEPLDLAVFWVEYV 467 

Qy 463 LQTGGATHLKPYVFQQPWHEQYLFDVFVFLLGLTLGTLWL CGKLLGMAVWWL 514 

i : I II I : I I : : : M I II M : : : II I 
Db 468 MRHKGAPHLRPAAHDLTWYQYHSLDVIGFLLAIVLTWFIVFKCCAYGCRKCFG 521 

Qy 515 RGARKVKET 523 

I Ml:: 
Db 522 -GKGRVKKS 529 



RESULT 10 
UDB1_RAT 

ID UDB1_RAT STANDARD; PRT; 529 AA. 

AC P09875; 

DT 01-MAR-1989 (Rel. 10, Created) 

DT 01-MAR-1989 (Rel. 10, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE UDP-glucuronosyltransf erase 2B1 precursor, microsomal (EC 2.4.1.17) 
DE (UDPGT) (UDPGTR-2). 
GN UGT2B1. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 



OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus . 

OX NCBI_TaxID= 10116; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Liver; 

RX MEDLINE=86196018; PubMed=308 447 9 ; 

t RA McKenzie P. I. ; 

RT "Rat liver UDP-glucuronosyltransf erase . Sequence and expression of a 

RT cDNA encoding a phenobarbital-inducible form."; 

RL J. Biol. Chem. 261:6119-6125(1986). 

RN [2] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=90293083; PubMed=2 113533 ; 

RA McKenzie P.I., Rodbourn L. ; 

RT "Organization of the rat UDP-glucuronosyltrans f erase, UDPGTr-2, gene 

RT and characterization of its promoter."; 

RL J. Biol. Chem. 265:11328-11332(1990). 

CC -!- FUNCTION: UDPGT is of major importance in the conjugation and 
CC subsequent elimination of ' potentially toxic xenobiotics and 

CC endogenous compounds . 

cc _i_ CATALYTIC ACTIVITY: UDP-glucuronate + acceptor = UDP + acceptor 

CC beta-D-glucuronoside . 

CC -!- SUBCELLULAR LOCATION: Microsomal. 

CC -!- INDUCTION: By phenobarbital . 

CC -!- SIMILARITY: Belongs to the UDP-glycosyltransf erase family. 

CC 7~~ 

CC This SWISS-PROT entryc is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; M13506; AAA42313.1; -. 

DR EMBL; M35086; AAA42310.1; -. 

DR EMBL; M35202; AAA42310.1; JOINED. 

DR EMBL; M35080; AAA42310.1; JOINED. 

DR EMBL; M35082; AAA42310.1; JOINED. 

DR EMBL; M35083; AAA42310.1; JOINED. 

DR PIR; A42233; A42233. 

DR InterPro; IPR002213; UDP_gluco_trans . 

DR Pfam; PF00201; UDPGT; 1. 

DR PROSITE; PS00375; UDPGT; 1. 

KW Transferase; Glycosyltrans f erase ; Glycoprotein; Transmembrane; Signal; 

KW Multigene family; Microsome. 

FT SIGNAL 1 23 BY SIMILARITY. 

FT CHAIN 24 529 UDP-GLUCURONOSYLTRANSFERASE 2B1. 

FT TRANSMEM 494 510 POTENTIAL. 

FT CARBOHYD 134 134 N-LINKED ( GLCNAC . . .) (POTENTIAL). 

FT CARBOHYD 316 316 N-LINKED (GLCNAC. . .) (POTENTIAL). 

SQ SEQUENCE 529 AA; 60484 MW; 14DF0224BF1C3044 CRC64; 

Query Match 24.1%; Score 668; DB 1; Length 529; 

Best Local Similarity 32.2%; Pred. No. 7.8e-45; 

Matches 175; Conservative 89; Mismatches 229; Indels 50; Gaps 15; 



Qy 6 VLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKRGPFMP 65 

: : I : : I I I : I I I I : : : : I I I I I : I : 

Db 11 LIQLICYFRPGA CGKVLVWPT-EYSHWINIKIILNELAQRGHEVTVLVSSASILIE 65 

Qy 66 DFKKEEKSYQVI SW-LAPEDHQREFKKSFDFFLE — ETLG GRGKFENLLN VLE 115 

I : : : : : | | : I : I I | : III I : : I I M 

Db 66 PTKESSINFEIYSVPLSKSDLEYSFAKWIDEWTRDFETLSIWTYYSKMQKVFNEYSDWE 125 

Qy 116 YLALQCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFG 171 

I I : | : | I : II:::: I I : I I I I I I I 

Db 126 NL CKALIWNKSLMKKLQGSQFDVILADAVGPCGELLAELLKTPLVYSLRFCPGYRCE 182 

Qy 172 SLEFGLPIPLSYVPVFRSLLTDHMDFWGRVKNFL — MFFSFCRRQQHMQSTFDNTIKEHF 22 9 

111:1 Mill I hi II MM I ::! I I : : : : 

Db 183 KFSGGLPLPPSYVPWLSELSDRMTFVERVKNMLQMLYFDF WFQPFKEKSWSQFY 237 

Qy 230 TE— GSRPVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPIKPVPQDLENFI 287 

: : | | : : : | | : : | | : : : | I I I I : II II I I I I : I : : : I I : 

Db 238 SDVLGRPTTLTEMMGKADIWLIRTFWDLEFPHPFLPNFDFVGGLHCKPAKPLPREMEEFV 297 

Qy 288 AKFGDSGFVLWLGSMWTCQNPEIFKEMNNAFAH^ 347 

| : | | : : I I I I I I : M I Ml hM I I M : 

Db 298 ..QSSGEHGVVVFSLGSMVKNL-TEEKANVVASAIiAQIP.QKVVWRFD- — GKKPDTLGSNTR 353 

Qy 348 IVDWLPQSDLLAHPSIRLFWHGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFG 407 

: I : M : II I I I : I I I I I I I I I I II M M I I II I I II M : : II 
Db 354 LYKWIPQNDLLGHPKTKAFVAHGGTNGIYEAIYHGIPIVGIPLFADQPDNINHMVAKGAA 413 

Qy 408 VSIQLKKLKAETLALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGG 467 

I : | I MM I II I : II 1:1 I I I I : M : : I 
Db 414 VRVDFSILSTTGLLTALKIVMNDPSYKENAMRLSRIHHDQPVKPLDRAVFWIEYVMRHKG 47 3 

Qy 468 ATHLKPYVFQQPWHEQYLFDVFVFLLGLTLGT LWLCGKLLGMAVWWLRGARK 519 

III:: I :: I I II I M MM I M 
Db 474 AKHLRSTLHDLSWFQYHSLDVIGFLLLCWGWFIITKFCLFCCRKTANM GKK 526 

Qy 520 VKE 522 

I I 

Db 527 KKE 529 



RESULT 11 
UD15_RAT 

ID UD15JRAT STANDARD; PRT; 531 AA. 

AC Q64638; 

DT 01-NOV-1997 (Rel. 35, Created) 

DT 01-NOV-1997 (Rel. 35, Last sequence update) 

DT 28-FEB-2003 (Rel. 41, Last annotation update) 

DE UDP-glucuronosyltransf erase 1-5 precursor, microsomal (EC 2.4.1.17) 
DE (UDPGT) (UGT1*5) (UGT1-05) (UGT1.5) (UGT1A5) (B5). 

GN UGT1. 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 
RN [1] 

RP SEQUENCE OF 1-286 FROM N.A. 



RC STRAIN=Wistar; 

RX MEDLINE=95332265; PubMed-7 608130 ; 

RA Emi Y., Ikushiro S.I., Iyanagi T . ; 

RT "Drug-responsive and tissue-specific alternative expression of 

RT multiple first exons in rat UDP-glucuronosyltransf erase family 1 

RT (UGT1) gene complex."; 

RL J. Biochem. 117:392-399(1995). 

RN [2] 

RP SEQUENCE OF 287-531 FROM N.A. 

RC TISSUE=Liver; 

RX MEDLINE=90274676; PubMed-2112380; 

RA Sato H., Koiwai 0., Tanabe K. , Kashiwamata S.; 

RT "Isolation and sequencing of rat liver bilirubin UDP- 

RT glucuronosyltransferase cDNA: possible alternate splicing of a common 

RT primary transcript."; 

RL Biochem. Biophys . Res. Commun. 169:260-264(1990). 

CC -!- FUNCTION: UDPGT is of major importance in the conjugation and 
CC subsequent elimination of potentially toxic xenobiotics and 

CC endogenous compounds . 

CC -!- CATALYTIC ACTIVITY: UDP-glucuronate + acceptor = UDP + acceptor 

CC beta-D-glucuronoside. 

CC -!- SUBCELLULAR LOCATION: Microsomal. 

CC - ! ~ ALTERNATIVE PRODUCTS : 

CC Event^Alternative splicing; Named isoforms=l; 

CC Comment=A number of isoforms may be produced. Isoforms have a 

CC different N-terminal domain and a common C-terminal domain of 

CC 245 residues; 

CC Name=l; 

CC IsoId=Q64 638-1; Sequence=Displayed; 

CC -!- SIMILARITY: Belongs to the UDP-glycosyltransf erase family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; D38069; BAA07263.1; -. 

DR EMBL; M34007; AAA42312.1; ALT_TERM. 

DR InterPro; IPR002213; UDP__gluco_trans . 

DR Pfam; PF00201; UDPGT; 1. 

DR PROSITE;. PS00375; UDPGT; 1. 

KW Transferase; Glycosyltransf erase; Glycoprotein; Transmembrane; Signal; 

KW Multigene family; Microsome; Alternative splicing. 

FT SIGNAL 1 25 POTENTIAL. 

FT CHAIN 26 531 UDP -GLUCURONOSYLTRANSFERASE 1-5. 

FT TRANSMEM 489 505 POTENTIAL. 

FT CARBOHYD 116 116 N-LINKED ( GLCNAC . . .) (POTENTIAL). 

FT CARBOHYD 131 131 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 139 139 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 293 293 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 431 431 N-LINKED (GLCNAC. . .) (POTENTIAL). 

SQ SEQUENCE 531 AA; 59993 MW; 04 14 8C1BA6CAAC8 0 CRC64; 



Query Match 



24.0%; Score 665; DB 1; Length 531; 



Best Local Similarity 30.5%; Pred. No. 1.3e-44; 

Matches 169; Conservative 102; Mismatches 216; Indels 68; Gaps 15; 



Qy 1 MAGQRVLLLVGFLLPGVLLSEAAKILTISTVGGSHYLLMDRVSQILQDHGHNVTMLNHKR 60 

: I I I I : : I I : I I : I : : I I I : I I I : I II ' I 

Db 7 LQGLAGLLLLLYALP WAEGGKVL-VFPMEGSHWLSMRDWRELHARGHQAWL 58 

Qy 61 GPFMPDFKKEEKSYQVISWLAPEDHQ REFKKSFD FFLEETLGGRGKF 107 

I : | | | : : : : I | III: III: 
Db 59 APEVTVHIKEEDFFTLQTYPVPYTRQGFRQQMMRNIKWFETGNYVKTFLETS 111 

Qy 108 ENLLNVLEYLALQCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKP FVA 164 

III: I I : I : : : | : : I I | I : : I : : I : I I I I : 
Db 112 EILKNISTVLLRSCMNLLHNGSLLQHLNSSSFDMVLTDPVIPCGQVL7VKYLGIPTVFFLR 171 

Qy 165 ILSTSFGSLEFGLPIPLSYVPVFRSLLTDHMDFWGRVKN FLMFFS FCRRQQH 216 

I I I I I : I :: I : I I I I I I I I : : I I I I : 

Db 172 YIPCGIDSEATQCPKPSSYIPNLLTMLSDHMTFLQRVKNMLYPLALKYICHFSFTRYESL 231 

Qy 217 MQSTFDNTIKEHFTEGSRPVLSHLLLKAELWFINSDFAFDFARPLLPNTVYVGGLMEKPI 27 6 

: I I I I I I : I I I I I : I I : : I I I : : I I : 

Db 232 ASELLQREVS — LVE VLSH ASVWLFRGDFVFDYPRPVMPNMVFI GGINCVI K 281 

Qy 277 KPVPQDLENFIAKFGDSGFVXVTLGSMVNTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHW 336 

I I : I : I : : I : I I : : I I I I I : : I : I : I I : : I : : 

Db 282 KPLSQEFEAYVNASGEHGIWFSLGSMVSEIPEKKAM-EIAEALGRIPQTLLWRYTGT — 338 

Qy 337 PKDVlilAANWIVlMLPQSDLLAHP 396 

: : I I I : I I I I I : I I I I I I I : I I I : I I I : I I I I I : I I I I I I : 

Db 339 -RPSNIAKNTILVKWLPQNDLLGHPKARAFITHSGSHGIYEGICNGVPMVMMPLFGDQMD 397 

Qy 397 NMTOVEAKKFGVSIQLKKLKAETLALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLV 456 

I I : I : I I : : : : : I : I : I : : : I I I : I : : I : I I 
Db 398 N7U<RMETRGAGVTLNVLEMTADDLENALKTVINNKSYKENIMRLSSLHKDRPIEPLDLAV 457 

Qy 457 GWIDHVLQTGGATHLKPYVFQQPWHEQYLFDVFVFLLGLTLGTLWL CGKLLG 508 

I : : : I : : I I I I : I I : : : I I I I I : I : : : III 

Db 458 FWVEYVMRHKGAPHLRPAAHDLTWYQYHSLDVIGFLLAIVLTWFIVYKSCAYGCRKCFG 517 

Qy 509 MAVWWLRGARKVKET 523 

I : I I 

Db 518 GKGRVKKS 525 



RESULT 12 
UDBJ_MACFA 

ID UDBJ_MACFA STANDARD; PRT; 528 AA. 

AC Q9XT55; 

DT 30-MAY-2000 (Rel. 39, Created) 

DT 30-MAY-2000 (Rel. 39, Last sequence update) 

DT 16-OCT-2001 (Rel. .40, Last annotation update) 

DE UDP-glucuronosyltransf erase 2B19 precursor, microsomal (EC 2.4.1.17). 

GN UGT2B19. 

OS Macaca fascicularis (Crab eating macaque) (Cynomolgus monkey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae ; 

OC Cercopithecinae; Macaca. 



OX NCBI_TaxID=9541; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Liver, and Prostate; 

RX MEDLINE=99203465; PubMed=l 0102 998 ; 

RA Belanger G., Barbier O., Hum D.W. , Belanger A.; 

RT "Molecular cloning, expression and characterization of a monkey 

RT steroid UDP-glucuronosyltransf erase, UGT2B19, that conjugates 

RT testosterone."; 

RL Eur. J. Biochem. 260:7 01-7 08(1999). 

CC -!- FUNCTION: UDPGT IS OF MAJOR IMPORTANCE IN THE CONJUGATION AND 

CC SUBSEQUENT ELIMINATION OF POTENTIALLY TOXIC XENOBIOTICS AND 

CC ENDOGENOUS COMPOUNDS. THIS ISOZYME DISPLAYS ACTIVITY TOWARD 

CC SEVERAL CLASSES OF XENOBIOTIC SUBSTRATES: EUGENOL, 4- 

CC METHYLLUMBELLI FERONE , P-NITROPHENOL, 1-NAPHTHOL, P, P 1 -BIPHENOL, 

CC NARINGENIN AND O, O 1 -BIPHENOL . ACTIVE ALSO ON 3A-HYDROXY AND 17 B- 

CC HYDROXY POSITIONS OF STEROIDS. 

CC -!- FUNCTION: CONTRIBUTES TO THE FORMATION OF ANDROGEN GLUCURONIDE IN 
CC EXTRAHEPATIC STEROID TARGET TISSUES SUCH AS THE PROSTATE. 

CC -!- CATALYTIC ACTIVITY: UDP-glucuronate + acceptor = UDP -f acceptor 
CC beta-D-glucuronoside . 

CC -!~ SUBCELLULAR LOCATION: Microsomal (By similarity). 

CC -!- TISSUE- SPECIFICITY: EXPRESSED IN . LIVER, OVARY, PROSTATE, COLON, 
CC KIDNEY, PANCREAS, BRAIN, CEREBELLUM, MAMMARY GLAND AND EPIDIDYMIS. 

CC NOT EXPRESSED IN SMALL INTESTINE, SPLEEN, BLADDER, ADRENAL GLAND 

CC AND TESTIS. 

CC -!- SIMILARITY: Belongs to the UDP-glycosyltransf erase family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib.ch). 

CC 

DR EMBL; AF112112; AAD24435.1; -. 

DR InterPro; IPR002213; UDP_gluco_trans . 

DR Pfam; PF00201; UDPGT; 1. 

DR PROSITE; PS00375; UDPGT; 1. 

KW Transferase; Glycosyltransf erase; Glycoprotein; Transmembrane; Signal; 

KW Multigene family; Microsome. 

FT SIGNAL 1 21 POTENTIAL. 

FT CHAIN 22 528 UDP-GLUCURONOSYLTRANSFERASE 2B19. 

FT TRANSMEM 493 513 POTENTIAL. 

FT CARBOHYD 315 315 N-LINKED (GLCNAC. . .) (POTENTIAL). 

SQ SEQUENCE 528 AA; 60741 MW; 3BFD2AE7 14A27AEE CRC64; 

Query Match 24.0%; Score 664.5; DB 1; Length 528; 

Best Local Similarity 33.2%; Pred. No. 1.5e-44; 

Matches 168; Conservative 89; Mismatches 218; Indels 31; Gaps 12; 

Qy 34 SHYLLMDRVSQILQDHGHNVTMLNHKRG PFMPDFKKEEKSYQVISWLAPEDHQREF 89 

||:::: I I I I I : I : II II : : : I : 

Db 34 SHWMNIKTILDELVQRGHEVTVLAYSTSILPDPNNPSPLKFEICPTSLTETEFQDSVTQL 93 

Qy 90 KKS FDFFLEETLGGRGKFENLLNVLEYLALQ CSHFLNRKDIMDSLKNENFDMV 142 



I : ::| : : |:| I : I :: I :| |: Ihl 

Db 94 VKRWSDIRKDTF WPHFLHVQEMMWTYGDMIRKFCKDWSNKKLMKKLQESRFDW 148 

Qy 143 IVETFDYCPFLIAEKLGKPFVAILSTSFG-SLE FGLPI PLSYVPVFRSLLTDHMDFW 198 

: : I I : I I I I I I I I I : I I I I I I I I I I I I I I 

Db 149 LADAISPCGELLAELLKIPFVYSLRFSPGYALEKHGGGFLFPPSYVPVTMSELRDQMTFM 208 

Qy 199 GRVKN--FLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHLLLKAELWFINSDFAFD 25 6 

I i : I : : : : I I : : : : : I : I I : : I I I : I I : : I 

Db 209 ERVQNMIYMVYFDFWFQWDVKN-WDQFYSK — VLGRPTTLFEIMAKAEIWLIRNYWDFQ 265 

Qy 257 FARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTLGSMVNTCQNPEIFKEM 316 

I I I I I I : I I I I | | | | : I : : : I I : I I : I I : : I I I I I : : I : 
Db 266 FPHPLLPNVEFVGGLHCKPAKPLPKEMEEFVQSSGDNGVWFSLGSMVSN-MSEERANVI 324 

Qy 317 NNAFAHLPQGVIWKCQCSHWPKDVHLAANVKI VDWLPQSDLLAHPS I RLFVTHGGQNS IM 376 

: I I : I I I : I : : I I I :: I I I I : I I I I I I I : I I I I I I 
Db 325 ASALAKI PQKVLWRFDGN KPDTLGLNTQLYKWLPQNDLLGHPKTRAFITHGGANGIY 381 

Qy 377 EAIQHGVPMVGIPLFGDQPENMVWEAKKFGVSIQLKKLKAETu^ 436 

I I I II : I I I I : I I I I I I : I : : : I I I : : : I : | : : | II 
Db 382 EAI YHG I PMVGVP L FADQ P DN I AHMKAKGAAVRLD FDTMS S T DLLNALKT VI ND P I YKEN 441 

Qy 437 AVAASVI LRSHPLS PTQRLVGWI DHVLQTGGATHLKP YVFQQPWHEQYLFDVFVFLLGLT 4 96 

I : II 1:1 lllhl:: I I I I : I : : II III 

Db 442 AMKLS S I HHDQPVKPLDRAVFWI EFVMRHKGAKHLRVAAHDLTWFQYHSLDVI GFLLACV 501 

Qy 497 LGTLWLCGKLLGMAW-WLRGARKVK 521 

::: II I I :: I : I I 

Db 502 ATVI FI ITKCL- FCVWKFVRTRKKGK 526 



RESULT 13 
UDB9_MACFA 

ID UDB9_MACFA STANDARD; PRT; 52 9 AA. 

AC 002663; 

DT 15-DEC-1998 (Rel. 37, Created) 

DT 15-DEC-1998 (Rel. 37, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE UDP-glucuronosyltransf erase 2B9 precursor, microsomal (EC 2.4.1.17) 

DE (UDPGT) . 

GN UGT2B9. 

OS Macaca fascicularis (Crab eating macaque) (Cynomolgus monkey) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Cercopithecidae; 

OC Cercopithecinae; Macaca. 

OX NCBI_TaxID=9541; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC TISSUE-Prostate; 

RX MEDLINE=98030212; PubMed=9364 930; 

RA Belanger G . , Beaulieu M. , Levesque E., Hum D.W., Belanger A. ; 

RT "Expression and characterization of a novel 

RT UDP-glucuronosyltransferase, UGT2B9, from cynomolgus monkey."; 

RL DNA Cell Biol. 16:1195-1205(1997). 

CC -!- FUNCTION: UDPGT IS OF MAJOR IMPORTANCE IN THE CONJUGATION AND 
CC SUBSEQUENT ELIMINATION OF POTENTIALLY TOXIC XENOBIOTICS AND 



CC ENDOGENOUS COMPOUNDS. THIS ISOZYME IS ACTIVE ON C18, C19, AND C21 

CC STEROIDS, BILE ACIDS, AND SEVERAL XENOBIOTICS INCLUDING EUGENOL, 

CC 1-NAPHTHOL, AND P-NITROPHENOL . 

CC -!- CATALYTIC ACTIVITY: UDP-glucuronate + acceptor = UDP + acceptor 

CC beta-D-glucuronoside . 

CC -!- SUBCELLULAR LOCATION: Microsomal. 

CC -!- SIMILARITY: Belongs to the UDP-glycosyltransf erase family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U91582; AAB50249.1; 

DR InterPro; IPR002213; UDP_gluco__trans . 

DR Pfam; PF00201; UDPGT; 1. 

DR PROSITE; PS00375; UDPGT; 1. 

KW Bile acid catabolism; Transferase; Glycosyltransf erase; Microsome; 

KW Signal; Transmembrane; Glycoprotein; Multigene family. 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 

i 

Best Local Similarity 32.9%; Pred. No. 1.8e-44; 

Matches 169; Conservative 78; Mismatches 220; Indels 47; Gaps 13; 

SHYLLMDRVSQI LQDHGHNVTML NH KRGPFMPDFKKEE KSYQ 75 

I I : : I : : I I I I I : I I : I I II : 



I I : I : I : | : : ■ : | : : : I : : I : I I 

VKRWIELPKD TFWLYFSQMQEIMWRFGDI I RNF CKDWSNKKLMKKL 140 

KNENFDMVIVETFDYCPFLIAEKLGKPFVAILSTSFGSLEF GLPI PLSYVPVFRS 189 

: I I : I : I I : I I II I : I : I I I I I I I I I 



I : I I I I I I I : II I : : I I I | | : | I : : I 
ILSDOMTFMERVKNMIYMLSFDFYFQMYDMKKWDQFYSE — VLGRPTTLSETMGKADIWL 257 



SIGNAL 


1 


21 


POTENTIAL. 


CHAIN 


22 


529 


UDP-GLUCURONOSYLTRANSFERASE 2B9. 


TRANSMEM 


494 


514 


POTENTIAL. 


CARBOHYD 


67 


67 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


CARBOHYD 


68 


68 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


CARBOHYD 


88 


88 


N-LINKED (GLCNAC. . .) (POTENTIAL). 


! SEQUENCE 


529 AA; 


60970 


MW; F110F85FE3A9DB8D CRC64; 


Query Match 




24. O 5 ; 


i; Score 663.5; DB 1; Length 529; 



Qy 


34 


Db 


34 


Qy 


76 


Db 


94 


Qy 


135 


Db 


141 


Qy 


190 


Db 


200 


Qy 


249 


Db 


258 


Qy 


309 


Db 


317 



II I I I I I : II I I I I I I : I : :■: I I : I : : i I : : I I I I I I 



I I : I I I : I : I I I : : I : I I : I I I I I I 



Qy 369 HGGQNSIMEAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETLALKMKQIM 428 

I I I I I I I I I I I I I I I I I I I I I I : I : : : I I : : : I I : : I : : 
Db 374 HGGANGIYE^IYHGVPMVGIPLFADQPDNIAHMKTKGAAVRLDFDTMSSTDLANRLKTVI 433 

Qy 429 EDKRYKSAAVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPWHEQYLFDV 4 88 

III : I I I : I I I I I : I : : I I I I : I | : : I I 

Db 434 NDPLYKENVMKLSRIQHDQPVKPLDRAVFWIEFVMRHKGAKHLRPAAHDLTWFQYHSLDV 493 



Qy 4 89 FVFLLGLTLGTLWLCGKLLGMAWWLRGARKVKE 522 

III : : : I I : I I I I : 

Db 494 IGFLLACVATVI FVIMKCCLFCFW — KFARKGKK 525 



RESULT 14 
UDBE RABIT 



ID UDBE_RABIT STANDARD; PRT; 530 AA. 

AC P36513; 

DT 01-JUN-1994 (Rel. 29, Created) 

DT 01-JUN-1994 (Rel. 29, Last sequence update) 

DT 16-OCT-2001 (Rel. 40, Last annotation update) 

DE UDP-glucuronosyltransf erase 2B14 precursor, microsomal (EC 2.4.1.17) 

DE (UDPGT) (EGT12). 

GN UGT2B14 . 

OS Oryctolagus cuniculus (Rabbit) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Lagomorpha; Leporidae; Oryctolagus. 

OX NCBI_TaxID=9986; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=New Zealand white; TISSUE=Liver ; 

RX MEDLINE=93315511; PubMed-8325897 ; 

RA Tukey R.H., Pendurthi U.R., Nguyen N.T., Green M.D., Tephly T.R.; 

RT "Cloning and characterization of rabbit liver UDP- 

RT glucuronosyltransf erase cDNAs . Developmental and inducible expression 

RT of 4-hydroxybiphenyl UGT2B13 . " ; 

RL J. Biol. Chem. 268:15260-15266(1993). 

CC -!- FUNCTION: UDPGT is of major importance in the conjugation and 
CC subsequent elimination of potentially toxic xenobiotics and 

CC endogenous compounds . 

CC -!- CATALYTIC ACTIVITY: UDP-glucuronate + acceptor = UDP + acceptor 

CC beta-D-glucuronoside . 

CC -!- SUBCELLULAR LOCATION: Microsomal. 

CC -!- DEVELOPMENTAL STAGE: Expressed primarily in adult rabbits. 

CC -!- SIMILARITY: Belongs to the UDP-glycosyltrans f erase family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; L01082; AAA18021.1; -. 

DR PIR; C47113; C47113. 

DR InterPro; IPR002213; UDP_gluco_trans . 



DR Pfam; PF00201; UDPGT; 1. 

DR PROSITE; PS00375; UDPGT; 1. 

KW Transferase; Glycosyltransf erase; Glycoprotein; Transmembrane; Signal; 

KW Multigene family; Microsome. 

FT SIGNAL 1 24 BY SIMILARITY. 

FT CHAIN 25 530 UDP-GLUCURONOSYLTRANSFERASE 2B14 . 

FT TRANSMEM 494 510 POTENTIAL. 

FT CARBOHYD 134 134 N-LINKED (GLCNAC. . .) (POTENTIAL). 

FT CARBOHYD 316 316 N-LINKED (GLCNAC. . .) (POTENTIAL). 

SQ SEQUENCE 530 AA; 60664 MW; CEAA4BF25B53CB35 CRC64; 

Query Match 23.9%; Score 662.5; DB 1; Length 530; 

Best Local Similarity 34.5%; Pred. No. 2.1e-44; 

Matches 161; Conservative 75; Mismatches 209; Indels 21; Gaps 10; 

Qy 50 GHNVTMLNHKRGPFMPDFKKEEKSYQVI SWLAPEDHQRE-FKKSFDFFLEETLGGRGKFE 108 

I I I : I : I : I : : : I : I : I : : I : 

Db 51 GHEVIVLRNSASIFIDPSKQANIKFETFPIAATKDDLEDLFVHYVSTWTNARQNSQWKYF 110 

Qy 109 NLLNVL— EY LALQCSHFLNRKDIMDSLKNENFDMVIVETFDYCPFLIAEKLGKPFV 163 

: I I I I I I : I : I I : I I : : : : I I : I I I I I I 

Db 111 SLLQKLFSEYSDSCENACKEWFNKTLMTKLQESRFDILLSDAIGPCGELLAELLKIPFV 170 

Qy 164 AILSTSFG SLEFGLPIPLSYVPVFRSLLTDHMDFWGRVKNFL— MFFSFCRRQQHM 217 

I : I I I : I I I I I : I I : I I I I I I :: I I I 

Db 171 Y S L RFT P G YTME K YS G GL S VP P S YVP 1 1 L S DL S GKMT FME RVNNML CML Y FD FW - FQMFN 229 

Qy 218 QSTFDNTIKEHFTEGSRPV-LSHLLLKAELWFINSDFAFDF7VRPLLPNTVYVGGLMEKPI 276 

: : I I III I I : I I : : I I I : : I I I I I I : I I I I II 

Db 230 KKRWDQFYS EVL GRPVTFSELVGKADMWLIRSYWDLEFPRPTLPNIQFVGGLHCKPA 28 6 

Qy 277 KPVPQDLENFIAKFGDSGFVTiVTLGSMVNTCQNPEIFKEMNNAFAHLPQGVIWKCQCSHW 336 

I I : I : : : I I : | : j | : : | | | | | : | :: I I I I I I I I I : 
Db 287 KPLPKEMEEFVQSSGEEGVWFSLGSMVSN-MTEERANLIASAFAQLPQKVIWRFD G 342 

Qy 337 PKDVliLAANVKIVDWLPQSDLLAHPSIRLFVTHGGQNSIMEAIQHGVPMVGIPLFGDQPE 396 

I I I : I I I : I I : I I I I I : I I I I I I I I I I I I I : I I I I : I I I I : I I : 

Db 343 QKPETLGPNTRI YDWI PQNDLLGHPKTKAFVTHGGANGI YEAIHHGI PMVGLPLFGEQPD 402 

Qy 397 NMVRVEAKKFGVSIQLKKLKAETLALKMKQIMEDKRYKSAAVAASVILRSHPLSPTQRLV 456 

I : : I I : : I : : I I : I : : I | | : I I I : I II 

Db 403 NIAHMTAKGAAI RLNWKTMSSEDLLNALKTVINDPSYKENVMTLSSIHHDQPMKPLDRAV 462 

Qy 457 GWIDHVLQTGGATHLKPYVFQQPWHEQYLFDVFVFLLGLTLGTLWL 502 

I I : : I : : II II: I : : II II: : : I 

Db 463 FWIEYVMRHKGAKHLRVAAHDLTWFQYHSLDWGFLVSCAAFLIFL 508 



RESULT 15 
UDB4_HUMAN 

ID UDB4__HUMAN STANDARD; PRT; 528 AA. 

AC P06133; 060731; 060867; 075614; P36538; 

DT 01-JAN-1988 (Rel. 06, Created) 

DT 15-DEC-1998 (Rel. 37, Last sequence update) 

DT 10-OCT-2003 (Rel. 42, Last annotation update) 

DE UDP-glucuronosyltransferase 2B4 precursor, microsomal (EC 2.4.1.17) 
DE (UDPGT) (Hyodeoxycholic acid) (HLUG25) (UDPGTH-1) . 



GN UGT2B4 OR UGT2B11. 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N . A. 

RC TISSUE=Liver; 

RX MEDLINE=87241362 ; PubMed-3109396; 

RA Jackson M.R., McCarthy L.R., Harding D., Wilson S., Coughtrie M.W.H., 

RA Burchell B. ; 

RT "Cloning of a human liver microsomal UDP-glucuronosyltransf erase 

RT cDNA. 11 ; 

RL Biochem. J. 242:581-588(1987). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Liver; 

RX MEDLINE=93326164; PubMed=83338 63 ; 

RA Jin C.-J., Miners J.O., Lillywhite K.J., McKenzie P.I.; 

RT "cDNA cloning and expression of two new members of the human liver 

RT UDP-glucuronosyltransf erase 2B subfamily-"; 

RL Biochem. Biophys . Res. Commun. 194:496-503(1993). 

RN [3] 

RP SEQUENCE FROM N.A. , VARIANT GLU-458, AND CHARACTERIZATION. 

RX MEDLINE=99303261; PubMed=10376768 ; 

RA Levesque E., Beaulieu M. , Hum D.W., Belanger A.; 

RT "Characterization and substrate specificity of UGT2B4 (E458): a 

RT UDP-glucuronosyltransf erase encoded by a polymorphic gene."; 

RL Pharmacogenetics 9:207-216(1999). 

RN [4] 

RP SEQUENCE FROM N.A. , AND VARIANTS LEU-109 AND LEU-396. 

RA McKenzie P. I . ; 

RL Submitted (JUL-1998) to the EMBL/ GenBank/DDBJ databases. 

RN [5] 

RP SEQUENCE FROM N.A. 

RA Riedy M. , Miller A.; 

RT "Genomic organization and structure of the UGT2B gene complex at human 

RT chromosome 4ql3." ; 

RL Submitted (MAR-1999) to the EMBL/ GenBank/DDBJ databases. 

RN [6] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Liver; 

RX MEDLINE-22388257; PubMed-12477932 ; 

RA Strausberg R.L., Feingold E.A. , Grouse L.H., Derge J.G., 

RA Klausner R. D . , Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H. , Moore T., Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L., Marusina K., Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T . E . , 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N. A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P. J., McKernan K.J. , Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M. , Gay L.J., Hulyk S.W.,- 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X. , Gibbs R.A., 

RA Fahey J. , Helton E. , Ketteman M. , Madan A., Rodrigues S., Sanchez A. , 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E. D. , Dickson M.C., 
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Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 
Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D.E., 
Schnerch A. , Schein J.E., Jones S.J.M., Marra M.A. ; 
"Generation and initial analysis of more than 15,000 full-length 
human and mouse cDNA sequences."; 

Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 



- i - 



FUNCTION: UDPGTs are of major importance in the conjugation and 
subsequent elimination of potentially toxic xenobiotics and 
endogenous compounds . This isozyme is active on polyhydroxylated 
estrogens (such as estriol, 4-hydroxyestrone and 2-hydroxyestriol ) 
and xenobiotics (such as 4-methylumbellif erone, 1-naphthol, 4- 
nitrophenol, 2-aminophenol , 4-hydroxybiphenyl and menthol) . It is 
capable of 6 alpha-hydroxyglucuronidation of hyodeoxycholic acid. 
CATALYTIC ACTIVITY: UDP-glucuronate + acceptor = UDP + acceptor 
beta-D-glucuronoside . 
SUBCELLULAR LOCATION: Microsomal. 

SIMILARITY: Belongs to the UDP-glycosyltransf erase family. 
CAUTION: REF.2 THOUGHT THAT THIS WAS A NEW FORM (UGT2B11). THE 
NAME UGT2B11 HAS NOW BEEN REUSED FOR ANOTHER HUMAN ENZYME. 



This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
.the. European Bioinf ormatics . .Institute . There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib.ch). 



EMBL; 
EMBL; 



Y00317; CAA68415.1; -. 
AF064200; AAC95002.1; 
EMBL; AJ005162; CAA06396.1; 
EMBL; AF081793; AAC32272.1; 
EMBL; AF135416; AAF78145.1; 
EMBL; BC026264; AAH26264.1; 
PIR; JN0619; JN0619. 
Genew; HGNC: 12553; UGT2B4. 
MIM; 600067; -. 

GO; GO: 0005792; C:microsome; NAS. 

GO; GO: 0006711; P:estrogen catabolism; IDA. 

GO; GO:0006805; P:xenobiotic metabolism; IDA. 

InterPro; IPR002213; UDP_gluco_trans . 

Pfam; PF00201; UDPGT; 1. 



DR 


PROSITE; 


PS00375; 


UDPGT; 1. 




KW 


Transferase; Glycosyltransf erase; Glycoprotein; Transmembrane; 


KW 


Multigene 


family; 


Microsome; 


Polymorphism. 


FT 


SIGNAL 


1 


23 


BY SIMILARITY. 


FT 


CHAIN 


24 


528 


UD P - GLUCU RONO SYLT RAN SFE RASE 2B4 . 


FT 


TRANSMEM 


493 


509 


POTENTIAL. 


FT 


CARBOHYD 


315 


315 


N-LINKED (GLCNAC. . .) (POTENTIAL) 


FT 


VARIANT 


109 


109 


F -> L. 


FT 








/ FTId=VAR 011328. 


FT 


VARIANT 


396 


396 


F -> L. 


FT 








/FTId-VAR_011329. 


FT 


VARIANT 


458 


458 


D -> E. 


FT 








/FTId=VAR_007712 . 


FT 


CONFLICT 


171 


172 


SL -> RP (IN REF. 1) . 


FT 


CONFLICT 


291 


293 


EME -> KWK (IN REF. 4) . 



FT CONFLICT 382 387 EAIYHG -> KAISPR (IN REF. 1). 

SQ SEQUENCE 528 AA; 60512 MW; 6B45E6769971A078 CRC64; 

Query Match 23.9%; Score 661.5; DB 1; Length 528; 

Best Local Similarity 34.2%; Pred. No. 2.5e-44; 

Matches 173; Conservative 87; Mismatches 215; Indels 31; Gaps 15; 

Qy 34 SHYLLMDRVSQILQDHGHNVTMLNHKRG PFMPDFKKEEKS YQVI SWLAPEDHQREF 89 

II:::: I I I I I : I II II : : I I : : 

Db 34 S HWMN I KT I L D ELVQRGH EVT VLAS S AS ISFDPNSPS T LK FEVY P VS LT KT E FE D 1 1 KQL 93 

Qy 90 KKSFDFFLEETLGGRGKFENLLNVL EYLALQCSHFLNRKDIMDSLKNENFDMVIVE 145 

I : : : I I : : : : I I : : I : I I : I I : I : : 

Db 94 VKRWAELPKDTFW — SYFSQVQEIMWTFNDILRKFCKDIVSNKKLMKKLQESRFDWLAD 151 

Qy 146 TFDYCPF— LIAEKLGKPFVAILSTSFG-SLE FGLPIPLSYVPVFRSLLTDHMDFWG 199 

II I : I I I III I I I : : I II I I I I I I I hi I 

Db 152 A — VFPFGELLAELLKIPFVYSLRFSPGYAIEKHSGGLLFPPSYVPWMSELSDQMTFIE 209 

Qy 200 RVKN — FLMFFSFCRRQQHMQSTFDNTIKEHFTEGSRPVLSHLLLKAELWFINSDFAFDF 257 

I I I I : : : : I I : I : : I I I II : I I : : I I : : I I 

Db 210 RVKNMIYVLYFEFWFQIFDMKK-WDQFYSE — VLGRPTTLSETMAKADIWLIRNYWDFQF 266 

Qy 258 ARPLLPNTVYVGGLMEKPIKPVPQDLENFIAKFGDSGFVLVTLGSMV-NTCQNPEIFKEM 316 

I I I I I : I I I I I I I I : I : : : I I : I : : I I : : I I II I I I : I : 
Db 267 PHPLLPNVEFVGGLHCKPAKPLPKEMEEFVQSSGENGVWFSLGSMVSNTSE — E RAN VI 324 

Qy 317 NNAFAHLPQGVIWKCQCSHWPKDVliLAANvTaVDWLPQ 376 

: I I : II I : I : : I I I : : I : I I : I I I I I I I : I I I I I I 
Db 325 ASALAKI PQKVLWRFDGN KPDTLGLNTRLYKWIPQNDLLGHPKTRAFITHGGANGIY 381 

Qy 377 EAIQHGVPMVGIPLFGDQPENMVRVEAKKFGVSIQLKKLKAETLALKMKQIMEDKRYKSA 436 

I I I II : I I I I : I I I I I I : I : : : I I II: : : I : I : : I II 
Db 382 EAIYHGIPMVGVPLFADQPDNIAHMK7VKG7^VSLDFHTMSSTDLLN7VLKTVINDPLYKEN 441 

Qy 437 AVAASVILRSHPLSPTQRLVGWIDHVLQTGGATHLKPYVFQQPWHEQYLFDVFVFLLGLT 496 

I: II I: I I I II: I:: MM: I : : II III 

Db 442 AMKLSRIHHDQPVKPLDRAVFWIEFVMRHKGAKHLRVAAHDLTWFQYHSLDVTGFLLACV 501 

Qy 4 97 LGTLWLCGKLLGMAVW-WLRGARKVK 521 

: : : II I I : : I : I I 
Db 502 ATVIFIITKCL-FCVWKFVRTGKKGK 526 



Search completed: May 7, 200$, 17:31:50 
Job time : 19 sees 



